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Abstract 

A Besicovitch set is a set which contains a unit line segment in any direc- 
tion. It is known that the Minkowski and Hausdorff dimensions of such a set 
must be greater than or equal to 5/2 in HL 3 . In this paper we show that the 
Minkowski dimension must in fact be greater than 5/2 + e for some absolute 
constant e > 0. One observation arising from the argument is that Besicovitch 
sets of near- minimal dimension have to satisfy certain strong properties, which 
we call "stickiness," "planiness," and "graininess." 

The purpose of this paper is to improve upon the known bounds for the 
Minkowski dimension of Besicovitch sets in three dimensions. As a by-product 
of the argument we obtain some strong conclusions on the structure of Besi- 
covitch sets with almost-minimal Minkowski dimension. 

Definition 0.1. A Besicovitch set (or "Kakeya set") E C W 1 is a set 
which contains a unit line segment in every direction. 

Informally, the Kakeya conjecture states that all Besicovitch sets in W 1 
have full dimension; this conjecture has been verified for n = 2 but is open 
otherwise. For the purposes of this paper we shall restrict ourselves to the 
Minkowski dimension, which we now define. 

Definition 0.2. If E is in M n , we define the (5-entropy £s(E) of E to be 
the cardinality of the largest <5-separated subset of E, and N$(E) to be the 
^-neighbourhood of E. 

Definition 0.3. For any set E C M n , the (upper) Minkowski dimension 
dim(E') is defined as 

dim(E) = limsuplog 1 / 5 £s(E) = n — liminf log^ \Ng(E)\. 
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In R n , Wolff [17] showed the estimate 



(1) dim(E) > -n + 1, 

while Bourgain [4] has shown 

13 12 



dim(£)>-n +25 . 
The latter result has recently been improved in [8] to 



(2) dim (£)>_ n + _ 

For further results, generalizations, and applications see [19]. 

When n = 3 Wolff's bound is superior, and thus the best previous result 
on the three-dimensional problem was dim(E) > 5/2. By combining the ideas 
of Wolff and Bourgain with some observations on the structure of (hypothet- 
ical) extremal counterexamples to the Kakeya problem, we have obtained the 
following improvement, which is the main result of this paper. 



Theorem 0.4. There exists an e > such that dim(.E) > 5/2 + e for all 
Besicovitch sets E in R 3 . 

While the epsilon in this theorem could in principle be computed, we have 
not tried to optimize our arguments in order to produce an efficient value for e. 
The argument in this paper certainly works for e = 10~ 10 , but this is definitely 
far from best possible. 

Broadly speaking, the argument is a proof by contradiction. A hypothet- 
ical counterexample to the theorem is assumed to exist for some small e. By 
refining the collection of tubes slightly, and at one point passing from scale 5 
to scale p = VS, one can impose a surprisingly large amount of structure on 
"most" of the Besicovitch set. Eventually there will be enough structure that 
one can apply the techniques of Bourgain [4] efficiently and obtain a contradic- 
tion. (Of course, if one applies these techniques directly then one would only 
obtain (2), which is inferior for n = 3.) 

By the term "most" used in the previous paragraph, we roughly mean 
that the portion of the Besicovitch set for which our structural assumptions 
fail only occupies an extremely small fraction of the entire set; we will make 
this notion precise in Section 5. We shall need this very strong control on the 
exceptional set, as there is a key stage in the argument in which we need to 
find an arithmetic progressions of length three in the nonexceptional portion 
of the Besicovitch set. A discussion of the difficulties of this approach when 
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one only knows that a small portion of each tube is "good" can be found in 
Bourgain [4]. This means that we will not use methods such as pigeonholing 
to obtain structural assumptions on our set, as these types of methods usually 
only give a nonexceptional set which is about log(l/(5) _1 of the full Besicovitch 
set. 

It may well be that one can use further ideas in [4], such as using triples of 
points whose reciprocals are in arithmetic progression, in order to circumvent 
this restriction. However, there is an additional obstruction preventing us 
from obtaining improvements to Theorem 0.4 such as a Hausdorff dimension 
or maximal function result, as in [17] or [4]. Namely, our argument crucially 
requires control of the entropy of the Besicovitch set not only at scale 5, but 
also at many intermediate scales between 5 and 1. In particular, the scale 
p = y/d plays a key role. Such control is readily available in the case when the 
upper Minkowski dimension is assumed to be small, but not in the other cases 
just discussed. 

We will derive structural properties on our hypothetical low-dimensional 
Besicovitch set in the following order. Firstly, we follow an observation in Wolff 
[18] and observe that the Besicovitch set must be "sticky," which roughly states 
that the map from directions to line segments in the Besicovitch set is almost 
Lipschitz. To make this observation rigorous we require the X-ray estimate in 
[18], and also rely crucially on the fact mentioned earlier, that we have control 
of the Besicovitch set at multiple scales. We will achieve stickiness in Section 3. 

Once we have obtained stickiness, it is a fairly routine matter to show that 
the Besicovitch set must behave in a self-similar fashion. For instance, if one 
takes a p-tube centered around one of the line segments in the Besicovitch set, 
and dilates it by around its axis, one should obtain a new Besicovitch set 
with almost identical properties. We will obtain quantitative versions of these 
heuristics in Section 6, after some preliminaries in Sections 4 and 5. 

We then combine these self-similarity properties with the following geo- 
metric heuristic: if for i = 1, 2, 3, we have a vector V{ and a family of tubes Tj 
which all approximately point in the direction Vi, then the triple intersection 



will be fairly small unless v±, V2, V3 are almost co-planar. As a consequence we 
be able to conclude a remarkable structural property on the Besicovitch set, 
which we call "planiness." Roughly speaking, it asserts that for most points 
x in the Besicovitch set, most of the line segments passing through x lie on a 
plane tt(x), or on the union of a small number of planes. At first we shall only 
derive this property at scale p, for reasons which shall become clear, but by 
changing scale we may easily impose this property at scale 5 as well. 



(3) 
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One can analyze the derivation of planiness further, and obtain an impor- 
tant additional property which we call "graininess." Roughly speaking, this 
asserts that the intersection of the Besicovitch set with any p-cube will, when 
studied at scale 5, look like a union of 5 x p x p boxes which are parallel 
to the plane n(x) mentioned earlier. This property is obtained by repeating 
the derivation of planiness, but with the additional observation that even if 
vi,V2, vs lie in a common plane tt(x), the set (3) will still be small unless the 
sets UreTi ^ are essentially of the form just described, assuming that the angles 
between v±, i>2, and i>3 are fairly large. 

We will define the properties of planiness and graininess rigorously in Sec- 
tion 8, after some preliminaries in Section 7. The derivation of these properties 
shall be the most technical part of the paper, requiring an increasingly involved 
sequence of definitions, but once these properties are obtained, the argument 
will become technically much simpler (though still somewhat lengthy). 

We remark that the arguments up to this point are not necessarily re- 
stricted to sets of dimension close to 5/2, although for other dimensions one 
must find an analogue of Wolff's X-ray estimate [18] to begin the argument. 
The argument below, however, is only effective near dimension 5/2. 

Next, we follow the philosophy of Bourgain [4] and find three p-cubes Qo, 
Qi, Q2 in arithmetic progression which each satisfy certain good properties. 
To do this it is important that all the properties attained up to this point occur 
on a very large fraction of the set, which unfortunately causes the arguments 
in previous sections to be somewhat involved. However, once we have obtained 
the arithmetic progression then one can be far less stringent, and deal with 
properties that are satisfied fairly sparsely. 

We now apply the ideas in [4] , which we now pause to recall. Very roughly, 
the argument in [4] for the Minkowski dimension runs as follows. Let A, B, C 
be the intersections of the Besicovitch set with three planes in arithmetic pro- 
gression. As the Besicovitch set contains many line segments through A, B, C, 
it follows that there are many pairs (a, b) of points in A x B whose midpoint 
is in C. In fact, if the dimension of the Besicovitch set is close to (n + l)/2, 
then a large fraction of A x B will have this property. Schematically, we may 
write this property as 

(A + B) C 2C. 

In particular, since we expect A, B, C to be of comparable size, we should 
have 

(4) \A + B\ » \A\ 

where we need to discretize A, B at some scale 8 to make sense of the above 
expressions. From the combinatorial lemmas in [4] relating sums to differences, 
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we should therefore have (in an appropriate sense) 



(5) 



A-B\<& \A\\B 



But this implies that many of the line segments connecting A with B are 
parallel, which contradicts the definition of a Besicovitch set. Hence we cannot 
have a Besicovitch set with dimension close to (n + l)/2. 

Suppose we applied the above ideas to our situation. If Qo, Qi, and Q2 
are three p-cubes in arithmetic progression, the Besicovitch set property should 
imply that there are many pairs (a, b) of points in Qq x Q2 whose midpoint is 
in Q\. Unfortunately, if our set has dimension close to 5/2, this set of pairs of 
points is very sparse compared to Qq x Q2, and the combinatorial lemmas in 
[4] are not effective in this context. 

However, this can be salvaged by using the planiness and graininess prop- 
erties of our set, and especially the fact that the planes through x are almost 
always parallel to the squares through x. These very restrictive properties 
drastically reduce the possible degrees of freedom of the Besicovitch set, and 
many pairs in Qo x Qi can be ruled out a priori as being of the form above. 
The end result is that one can find a well-behaved subset G of Qq x Q2 which 
is determined by the planes and grains, such that the midpoint of a and b is 
in Q\ for a large fraction of pairs (a, b) in G. By applying the lemmas of [4] 
we find once again that the Besicovitch set contains many tubes which are 
parallel, if e is sufficiently small. This is our desired contradiction, and we are 
done. 

For completeness we also give in an Appendix a sketch of the argument 
from [4] that we use. 

The properties of planiness and graininess may seem strange, but there is 
a simple example of an object which resembles a 5/2-dimensional Kakeya set 
- albeit in C 3 instead of M 3 — and which does obey these properties. Namely, 
the Heisenberg group 



has real dimension 5, contains a four-parameter family of lines (some of which, 
though, are parallel), and satisfies the planiness and graininess properties per- 
fectly. We will discuss this example further in Section 13. 

The authors are indebted to Jean Bourgain for explaining his recent work 
to one of the authors, the referee for helpful comments, and especially to Tom 
Wolff for his constant encouragement and mathematical generosity. Many 
of the "new" ideas in this paper were inspired, directly or indirectly, by the 
(mostly unpublished) heuristics, computations, and insights which Tom Wolff 
kindly shared with the authors. The first and third authors are supported by 
NSF grants DMS-9801410 and DMS-9706764 respectively. 



{{zi,z 2 ,z 3 ) G C 3 : 



Im(z 3 ) = lm(z 1 z 2 )}. 
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1. Notation and preliminaries I 

Throughout this paper we shall always be working in three dimensions R 3 
unless otherwise specified. We use italic letters x,y,z to denote points in R 3 , 
and Roman letters (x, y, z) to denote coordinates of points in M 3 . 

Unless otherwise specified, all integrals will be over M 3 with Lebesgue 
measure. 

In this paper 5 refers to a number such that 0<5<Cl, and e refers to a 
fixed number such that < e <C 1. d will be a number such that 2 < d < 3; 
later on we will set d = 5/2. The symbol p will always denote the quantity 
p = 

We use C, c to denote generic positive constants, varying from line to line 
(unless subscripted), which are independent of e, 5, but which may depend on 
the parameter d. C will denote the large constants and c will denote the small 
constants. 

We will use X < Y, Y > X, or X = 0(Y) to denote the inequality 
X < AY, where A is a positive quantity which may depend on e. We use 
I » 7 to denote the statement X > AY for a large constant A. We use 
X ~ Y to denote the statement that X < Y and Y < X. 

We will use X < Y, Y > X, or "Y majorizes X 11 to denote the inequality 

X < AS' Ce Y, 

where A is a positive quantity which may depend on e, and C is a quantity 
which does not depend on e. We use X ps Y to denote the statement that 
X < Y and Y < X. In particular we have e ~ 1. 

If E is a subset of M ra , we use \E\ to denote its Lebesgue measure; if / is 
a finite set, we use to denote its cardinality. 

For technical reasons, we will require a nonstandard definition of a <5-tube. 
Namely, a 5-tube T is a ^-neighbourhood of a line segment whose endpoints xq 
and x\ are on the planes {(x, y, z) : z = 0} and {(x, y, z) : z = 1} respectively, 
and whose orientation is within of the vertical. Note that 

(6) |T CT |~a 2 

for any cr-tube T a . We define a direction to be any quantity of the form (x, y, 1) 
with |x|, |y| < 1, and dir(T) to be the direction x\ — xq. 

If T is a tube, we define CT to be the dilate of T about its axis by a factor 
C. We say that two tubes T and T' are equivalent if T C CT' and T' C CT. 
If T is a set of tubes, we say that T consists of essentially distinct tubes if for 
any T 6 T there are at most O(l) tubes T 1 which are equivalent to T. 

We use the term r-ball to denote a ball of radius r, and use B(x, r) to 
denote the r-ball centered at x. By a simple covering argument using 5-balls, 
we see that the quantities £s(E) and Ng(E) defined in Definition 0.2 are related 



MINKOWSKI DIMENSION OF BESICOVITCH SETS IN K 3 



389 



in R 3 by the basic estimate 

(7) 5 3 £s(E) ~\Ns(E)\. 

If n is a function, we use supp(/u) to denote the support of fi. 

If 1 < d < oo is an exponent, we define the dual exponent by d! = dj (d— 1), 
with oo' = 1. The support, L 1 norm, L d norm, and size of a function are all 
related of course by such standard inequalities as Holder and Chebyshev. We 
will rely on these inequalities extremely often, and so we write them down for 
future reference. 

Lemma 1.1. For any nonnegative function n on a measure space (continuous 
or discrete), any 1 < d < oo, and any A > 0, we have 

(8) HHIi £ ||HU|supp(//)| 1/d , 

(9) \\n\\df £ ||fi||i|supp(/i)|- 1 / d , 

(10) / H £ A^'lMll 



(11) / n < A|supp( M )|, 

(12) \{^\}\ & x~ d 'Mi. 



We remark that all the above quantities will automatically be finite in our 
applications. 

Finally, we observe the following trivial uniformity lemma. 

Lemma 1.2. Let fx be a nonnegative function on a measure space (continuous 
or discrete), such that 

|supp(»| < A, 

HHIoc - B, 
||/x||i > AB 

for some A, B > 0. Then there exists a nonempty set E C supp(/x) such that 

\E\ ps |supp(/i)| ps A, 

/j, ps B on E, 
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Proof. Define A by 



X = (AB)" 1 \\ii\\ 1 ; 



from hypothesis we have Awl. 



We define E to be the set 




From the estimate 



XAB = M L i(E) + IImIIli(bc) < B\E\ + -XBA 



which follows from the hypotheses, we see that \E\ <; A. The verification of 



It is of course possible to make this lemma more precise (e.g. by the 
pigeonhole principle), but we shall not do so here. 



In this section we summarize the Kakeya and X-ray estimates which we 
shall need. In the following a, 6 are quantities such that 5 < a < < 1. 

Definition 2.1. If TV is a collection of cr-tubes, we define the directional 
multiplicity m = m(T a ) to be the largest number of tubes in T a whose direc- 
tions all lie in a cap of radius a. If m w 1, we say that T CT is direction- separated. 

Definition 2.2. Let 2 < d < 3. We say that there is an X-ray estimate 
at dimension d if there exist < a, (3 < 1 for which the following statement 
holds: For any <5-separated set £ of directions and any collection T of essentially 
distinct tubes pointing in directions in £, 



where m is the directional multiplicity of T. If we only assume that (13) holds 
with < a < 1 and (3 = 0, then we say that there is a Kakeya estimate at 
dimension d. 

Clearly, an X-ray estimate is stronger than a Kakeya estimate at the same 
dimension. 

Theorem 2.3 ([17], [18]). There is a X-ray estimate at dimension 5/2. 

Indeed, (13) is proven in [17] for (a, (3, d) = (7/10, 0, 5/2), while in [18] this 
is improved to (a,/3, d) = (7/10, 1/4,5/2). Although these values of a and /3 
are sharp for this value of d, their exact values are not particularly important 



the remainder of the properties are now routine. 



□ 



2. Kakeya estimates 



(13) 
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for our purposes. We remark that an estimate with a = can automati- 
cally be improved to an estimate for some positive a thanks to Nikishin-Pisier 
factorization theory; a discussion of this phenomenon may be found in [1]. 
We will usually only rely on the following variants of the above estimate: 

Lemma 2.4. Let 6 < a < 9 <C 1, and let T a be a collection of a-tubes 
whose set of directions all lie in a cap of radius 9. Let 2 < d < 3 be fixed. 

• If we have a Kakeya estimate at some dimension d, and if the collection 
TV is direction- separated, then 



(14) 



T„eT a 



„ ±± - d + 1 
< a d 9 d , 



If we have an X-ray estimate at some dimension d, and if T CT consists of 
essentially distinct tubes, then 



(15) 



T CT eT CT 



< 



d-3 d+1 



a d 



d m 



1-/3 



for some > 0, where m is the directional multiplicity ofT a . 

Proof. We prove only the second claim, as the first follows by setting m ~ 1 
and (3 = 0. By an affine transformation we may assume that u = (0,0,1). 
Apply the nonisotropic dilation (x, y, z) (# _1 x, 9~ 1 y, z). This transforms T CT 
to a collection T' of a9~ l tubes pointing in a cr^ _1 -separated set of directions, 
without significantly affecting the directional multiplicity. From (13) we have 



J2 *t> 

TGT' 



, d-3 -3-d ,_a 

<ad0dm ip . 



The claim then follows by undoing the dilation. 



□ 



In the specific case d = 5/2, a = 7/10 we can also obtain the above lemma 
directly from (13). 



3. The sticky reduction 



In the rest of the paper, 2 < d < 3 will be a number such that there is 
an X-ray estimate at dimension d. In particular, by the results in [18] we may 
choose d = 5/2. 

It is well known that Besicovitch sets must have Hausdorff and Minkowski 
dimensions > d. The purpose of this section is to show that one can push this 
observation a bit further, and conclude that sets whose Minkowski dimension 
is close to d have a certain "sticky" structure. Our arguments crucially rely 
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on the fact that sets with small Minkowski dimension are under control at 
several scales simultaneously (in particular, at the scales 6 and p = y/S); there 
does not appear to be any obvious way to apply these heuristics to a set with 
Hausdorff dimension close to d, for instance. 

Definition 3.1. Let T be a collection of 5-tubes. We say that T is sticky 
at scale 5, or just sticky for short, if it is direction-separated and there exists a 
collection T p of direction-separated /9-tubes and a partition of T into disjoint 
sets T[T p ] for T p G T p such that 

(16) T C T p for all T p G T p and T G T[T p ] 
and we have the cardinality estimates 

(17) #T « <T 2 , 

(18) #T p « <r\ 

(19) #T[T p ] « (T 1 for all T p G T p . 
We call T p the collection of parent tubes of T. 

For technical reasons we shall also need an iterated version of stickiness: 

Definition 3.2. Let T be a collection of direction-separated 5-tubes of 
cardinality rs 5~ 2 . We say that T is doubly sticky if it is sticky at scale 5, and 
its collection T p of parent tubes is sticky at scale p. 



Proposition 3.3. Suppose that there is an X-ray estimate at dimen- 
sion d, and that there exists a Besicovitch set E with d\m(E) < d + e. Then 
for any sufficiently small 5, there exists a doubly sticky collection T of tubes at 
scale 5 with parent collection T p and grandparent collection T S i/ 4 , such that 



(20) 
(21) 

(22) 



| U T | « * 3_d . 

TeT 

U t p\ & p 3 ~ d > 

TpGTp 



U TV/4 



Proof. Let E be a Besicovitch set with Minkowski dimension at most d+e, 
and let 5 <C 1 be a fixed. We may assume without loss of generality that E is 
contained in a fixed ball B(0, C). Then by Definition 0.3 we have 

(23) \N a (E)\ <<7 3 - d for all <5 < <r < 1. 
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By taking a <5-separated set of directions oriented within of the vertical, 
and looking at the associated line segments in E, we can find (possibly after 
rescaling E slightly) a direction-separated set T of (5-tubes satisfying (17) such 
that each tube is contained in Nq§(E). Unfortunately this collection need 
not be sticky, let alone doubly sticky. To remedy this we shall prune T of its 
nonsticky components. 

Let £ be a maximal p-separated set of directions. Call a direction u E £ 
sticky if the tubes T E T such that dir(T) E B(uj, p) can be covered by 0(5~ Cie ) 
p-tubes pointing in the direction uj; here C\ is a constant to be chosen later. 




Figure 1. An example of a nonsticky direction uj. The thin tubes 
are (5-tubes in Ncs(E), the fat tubes are p-tubes in Nc p {E). Note 
that Nc p {E) is rather large. 




Figure 2. An example of a sticky direction uj. 

We now observe that only a small number of directions uj are nonsticky. 
More precisely, let £\ be the subcollection of directions in £ which are nonsticky. 
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By definition, we can find for each uj G S\ a collection of > 5 Cl£ disjoint p- 
tubes which are contained in Nq p (E). Call the union of all these collections 
T' p . Then by (6) 



E XT 

TeT' 



p 2 #t; > r Cie (^ 2 #fi). 



From (9) and (23) we thus have 



TeT' 



E xt ><r^v#£i)p-^ 



On the other hand, from (13) we have 



E 

TeT' 



XT 



< 



■/») 



(p 2 #£i 



Combining the two estimates we obtain 

where c depends on a, (3. Since the tubes in T are direction-separated, we thus 
see that at most £ 5~ 2 5 cCie tubes in T that point within 0(5) of a nonsticky 
direction. We may therefore remove these tubes from T without significantly 
affecting (17), if C\ was chosen sufficiently large. 

The collection T now has no nonsticky directions. Thus, we may cover 
the tubes in T by a family of p-tubes T p which are direction-separated. In 
particular, we have the upper bound in (18). Since T is direction-separated, 
each Tp can cover at most ~ <5 _1 tubes in T, and so we have the lower bound 
in (18). Let {T[T p ]} be any partition of T for which (16) always holds. As we 
have just observed, 

#t[t p ] < r 1 

for each T p E T p . On the other hand, from (17) we see that 



E # T iT P 



C-2 



TpGT p 



From these estimates, (18), and Lemma 1.2, we can find a subset Tj of T f 



such that #TL 



and (19) holds for all T p G T p . If we now replace T 



with UTpeTp' ^[Tp] and T p with T p we see that we have obtained (19) without 
significantly affecting any of the other properties just derived. 

We now repeat the above procedure but with 5 replaced by p, and T 
replaced by T p . This allows us to refine the collection T p so that it is also 
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sticky, with an associated collection T 5 i/ 4 of 5 1//4 -tubes. Of course, to maintain 
consistency we have to remove the tubes T[T p ] from T every time we remove 
a tube T p from T p , but this does not cause any difficulty, and one may verify 
that all the claims in the proposition hold. Note that the claims (20), (21), 
and (22) will follow from (23). □ 

Henceforth we assume that C\ has been set to an absolute constant, and 
allow all future constants to implicitly depend on C\. We will continue this 
convention when we choose C2, C3, etc. 

The above result established stickiness at the scales p and J 1 / 4 . In fact, 
one could quite easily establish stickiness at every scale 5 < a < 1 from the 
Minkowski dimension hypothesis, but we shall not need to do so here. We 
also remark that the above argument requires no special numerology on d, and 
would work perfectly well for d > 5/2, providing of course that we had an 
X-ray estimate at d. 

For most of the argument we shall not need double stickiness, and derive 
most of our results just by assuming stickiness. In fact, the only place we 
shall need double stickiness is in Proposition 9.2, where we need to move the 
planyness property, which is initially derived at scale p, to the finer scale of S 
without losing the stickiness property. 

4. Notation and preliminaries II 

In the rest of the paper, < e <C 1 will be a fixed small number (say 
e = 10~ 10 ), and T will be a sticky collection of tubes satisfying (20) and (21). 
With the exception of Proposition 9.2, we will not use the double-stickiness 
property, and will not change the value of 5. 

For future reference we shall set out some notation and estimates which 
we shall use frequently. We use T a to denote a tube of thickness a; if T is 
unsubscripted, we assume it to have thickness 5. 

Definition 4.1. For any x € M 3 and T p G T p , we define the sets T(x), 
T p (x), andT[T p ](x) by 

T(x) = {T e T : x e T}, 
T p (x) = {T p G T p : x G T p }, 
T[T p ]{x) = {T G T[T p ] : x € T}. 
Definition 4.2. We define the sets E$, E p , and Es[T p ] for all T p G T p by 

E S = \JT, 

TeT 
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E P = (J T p< 

T p eT p 

E S [T„] = (J T. 

TeT[T p ] 

We similarly define the multiplicity functions fi$, \i p , and /A5[T p ] by 



(*) = Ext(x) = #t(x), 



TeT 



Vp( x ) = XT p (x) = #T p (x), 

M[T P ](x) = J2 XT p (x) = #T[T P ](x). 

TeT[T p ] 

The size of Eg and E p is controlled by (20) and (21), while the L d ' norms 
of fig, fJ-p, and //j[r p ] can be controlled by (14). To complement these bounds 
we have the following precise control on the L 1 norms of /j, s d, and ^s\T p ]: 

Lemma 4.3. Let T p be an element ofT p , and xq be a point in T p . Then 
we have the L 1 estimates 



(24) Wfish ~ 1, 

(25) H/Xpll! « 1, 

(26) \\ns[T P ]\\i ~ 6, 

(27) \\MT p ]\\ LHB{xo , Cp) ) « S 3 / 2 . 



Proof. The estimates (24), (25), (26) follow from (17), (18), (19), and 
(6). The proof of (27) is similar and relies on the geometrical observation (cf. 
Figure 3) 



(28) / xt ~ £ 5/2 

JB(x ,Cp) 

for any Te T[T p ]. □ 



5. Properties which occur with probability close to 1 
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In the sequel we shall frequently need a quantitative version of the state- 
ment "The property P(x) implies the property Q(x) with probability close to 
1". This motivates 

Definition 5.1. Let P(x) and Q(x) be logical statements with free parame- 
ters x = (xi, . . . , x n ), where each of the variables Xi ranges either over a subset 
of Euclidean space, or over a discrete set. We use 

(29) 9Q(x) : P(x) 
to denote the statement that 

(30) \{x : Q(x) holds, but P(x) fails}] < 5 cV ~ e \{x : Q(x) holds}] 

for some absolute constant c > 0, where the sets are measured with respect to 
the measure dx = Ylf = i dxi, and dxi is Lebesgue measure if the Xi range over a 
subset of Euclidean space, or counting measure if they range over a discrete set. 

In practice our variables Xi will either be points in M 3 (and thus endowed 
with Lebesgue measure), or tubes in T or T p (and thus endowed with counting 
measure). Thus, for instance, 

V T G T,x € T : P{x,T) 
T, x 

denotes the statement that 

£|{*eT:P(*,T)fails}|<^£|T|. 

TeT TeT 

The right-hand side of (30) will always be automatically finite in our applica- 
tions. Note that (29) vacuously holds if P(x) is never satisfied. 

Of course, (30) is trivial if c = 0. The reason why we choose the factor 
jc^/e - g that it is much smaller than any quantity of the form 5 Ce , but much 
larger than anything of the form 5 C . In particular, (30) implies 

(31) \{x:Q(x),P(x)}\~\{x:Q(x)}\. 

Here and in the rest of the paper, the expression u Q(x), P(x)" is an abbrevia- 
tion for u Q(x) and P(x) both hold." 

The symbol V should be read as "for most x such that," where "most" 

means that the event occurs with probability very close to 1. Observe that the 
meaning of V does not depend on the choice of scale (5, p, or 5 1 / 4 ), except that 

the c in (30) may change value. 

We now develop some technical machinery to manipulate expressions of 
the form (29). This machinery would all be trivial if V were replaced by V, and 
are not particularly difficult to prove. We first observe some trivial lemmas. 
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Lemma 5.2. Suppose that m ~ 1 is an integer and Q and Pi, ... , P m are 
properties depending on some free parameters x±, . . . , x n , such that 

VQ(x) : Pi(x) holds for all i = 1, . . . ,m 

where the implicit constants are independent of i. Then we have 

9Q(x):P 1 (x),...,P m (x). 

Proof. Apply (30) for each Pi(x) and sum in i. □ 

Let Q(x) =>■ P{x) denote the statement u Q(x) fails, or P(x) holds". 

COROLLARY 5.3 (Modus Ponens). If R(x), Q(x), P(x) are properties 
such that 

9R{x) : Q(x) andyR(x) : (Q(x) P(x)) 
hold, then we have 

jR{x) : P(x). 

Lemma 5.4. Let P, Q, R be properties with free parameters x = (x±, . . . , x n ) 
such that 

Vi?(x) : Q{x) and\(R(x),Q(x) : P(x) 

hold. Then we have 

9R(x) :P{x),Q{x). 

Proof. Applying (31) to the first hypothesis, and (30) to the second, we 
obtain 

9R(x) : (Q(x) P(x)), 
and the claim follows from Corollary 5.3. □ 

Lemma 5.5. Let P(x), Q(x, y), R(x, y) be properties with free parameters 
x = (x\ , . . . ,x n ) and y = (yi , . . . , y m ) such that 

VQ(x,y) : R(x,y) 

uniformly for all x satisfying P(x). Then we have V P{x), Q(x, y) : R(x, y). 

x, y 

Proof. Apply (30) to the hypothesis, and integrate over all x that satisfy 
P(x). □ 

Unlike the case with the more familiar V quantifier, some care must be 
taken with V when adding or removing dummy variables if this significantly 
changes the underlying measure. For instance, 



VxG |J T:P( 3 



x 

TeT 
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is not necessarily equivalent to 

V T G T,x G T : P(x) 
T,x 

because the underlying measures are quite different. In the former case all 
points x in IJtst have equal weight, whereas in the latter case the weight of a 
point x is proportional to the multiplicity fi$(x). 

On the other hand, it is legitimate to add or remove dummy variables 
when the multiplicity is approximately constant. More precisely: 

Lemma 5.6. Let Q\{x), Q 2 (x,y) be properties such that 

(32) \{y.Q2(x,y)}\ ~M 

whenever Qi(x) holds for some quantity M independent of x. Then for any 
property P(x), the statements 

VQi(x):P(x) 

and 

\/Q 1 (x),Q 2 (x,y):P(x) 
? y 

are equivalent. 

Proof. From (32) we see that 

\{{x,y) : Q 1 (x),Q 2 (x,y)}\ « M\{x : Q 1 (x)}\ 

and 

\{(x,y) : Qi(x),Q 2 (x,y) hold, P(x) fails}| w M\{x : Q^x) holds, P{x) fails}). 

The claim then follows by expanding the hypothesis and conclusion using (30) . 

□ 

Corollary 5.7. Suppose the set Y is partitioned into disjoint subsets 
Y[z] as z ranges over an index set Z. Then for any properties Q(x, y), P(x, y) 
and any free variables x, the statements 

9yeY,Q(x,y):P(x,y) 
x, y 

and 

V z G Z,y eY[z],Q(x,y) : P(x,y). 

x, y, 4, 

are equivalent. 

Proof. Apply Lemma 5.6 with x, y equal to (x, y), z respectively, M equal 
to 1, and Q 2 ((x, y), z) equal to the property that z G Z and y G Y\z\. □ 

We will usually apply this corollary with Y = T, Z = T p . 
Next, we show how a compound V quantifier can be split up into two 
simpler quantifiers. 
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Lemma 5.8. Suppose that Q(x,y), P(x,y) are properties depending on 
some free parameters x = (xi, . . . , x n ), y = (y±, . . . , y m ). Then the statements 

(33) \f Q(x,y) : P(x,y) 

x, y 

and 

(34) V Q(x,y'):WQ(x,y):P(x,y)] 
x,y y 

are equivalent (up to changes of constants). 

Whenever we have nested V statements as in (34), we always assume the 
implicit constants in the inner V to be independent of the variables in the 
outer V. 

Proof. Let R(x) denote the property that 

VQ(x,y) : P(x,y) 

holds. 

Assume first that (33) held. If R(x) failed, then 

\{y : Q(x,y) holds, P(x,y) fails}| > 5 c '^\{y : Q(x,y)}\ 

for some d which we shall choose later. Integrating this over all x for which 
R(x) failed, we obtain 

\{{x,y) : Q{x,y) holds, P(x,y) fails}] > 5 c '^|{(x,y) : Q(x,y) holds, R(x) fails}]. 

From (33) we therefore have (if c' is chosen sufficiently small) 

\{{x,y) : Q{x,y) holds, R(x) fails}) < 5 c ^\{{x,y) : Q(x,y)}\, 

and (34) follows by replacing y with y' . 

Now suppose that (34) held. We need to show that 

(35) J \{y : Q(x,y) holds, P(x,y) fails}| dx < 5 C ^ J \{y : Q(x,y)}\ dx. 

The contribution to the left-hand side of (35) when R(x) holds is acceptable, 
by the definition of R(x). The contribution when R(x) fails is majorized by 

/ \{y : Q(x,y)}\ = f \{y' : Q(x,y') holds, R(x) fails}) dx 

JR(x) fails J 

and this is acceptable by (34). □ 

Corollary 5.9. Suppose that Q\(x), Q2(x,y), and P(x,y) are prop- 
erties depending on some free parameters x = (x±, . . . , x n ), y = (y±, . . . , y m ) 
which obey (32). Then, the statements 

(36) yQ 1 (x),Q 2 (x,y):P(x,y) 

x ) y 
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ant 



(37) VQ 1 (x):lVQ 2 (x,y):P(x,y)} 
are equivalent (up to changes of constants). 

Proof. Assume that (36) held. By Lemma 5.8 we have 

V Qi(x),Q 2 (x,y') : [V Qi(x), Q 2 (x, y) : P(x, y)]. 

x,y y 

The second Qi(x) is redundant. Eliminating the y' variable using Lemma 
5.6, one obtains (37). The converse implication follows by reversing the above 
steps. □ 

We now apply the above machinery to our specific setting, in which we 
have a sticky collection of tubes. 

Lemma 5.10. Let T be a sticky collection of tubes, and let P(y,T p ,T) be 
a property. Then the statements 



(38) V T p €T p ,T€T[T p ],y€T:P(y,T p ,T), 

T p , T, y 



(39) V T p €T p ,T€T[T p ],y€T,x€T p nB(y,Cp):P(y,T p ,T), 
T p ,T,y,x 

and 

(40) V T p eT p ,x ET p : [9 T eT[Tp],y ETn B(x,Cp) : P(y,Tp,T)} 
T p ,x T,y 

are equivalent (up to changes of constants). 

Proof. The equivalence of (38) and (39) follows from Lemma 5.6, since 
(32) follows from the trivial estimate 

\T p n B(y, Cp)\ w p 3 whenever y G T G T[T p }. 

But if we rearrange (39) as 

V T p eTp,xeTp,TeT[T p ],yeTnB(x,Cp):P(y,Tp,T), 
T p , x, T, y 

then the equivalence of (39) and (40) follows from Corollary 5.9, since (32) 
follows from (27). □ 
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6. Uniformity and self-similarity 

In this section we shall investigate how the stickiness hypothesis, combined 
with the Kakeya estimate at scale d, implies certain self-similarity properties 
of the set E$. We will then show in later sections how these properties imply 
planiness and graininess properties of the set, with these efforts culminating 
in Corollary 9.5. 

The notation will be as in the previous section. Informally, the main result 
of this section shall be 

Heuristic 6.1. The set Eg can be mostly covered by about p~ d p-balls 
B, such that the intersection of Es with most of these balls has volume about 
5 3 ~ d p d . Most points x G Utst^ are contained in about (5~( 3 ~ rf ) tubes T G T, 
which in turn are mostly contained in about p~( 3 ~ d ) families T[T p ] in such a 
way that each family contains p~( 3_d ) of the tubes. Also, the set E$ is not 
"lumpy" at any scale greater than 5, in the sense that 

(41) l^n^l < \B a \ 

for all a S> 5 and all a-balls B a . Finally, the angle subtended by most pairs of 
intersecting tubes is « 1. 

Note that the numerology in this heuristic is consistent with (20), (21), 
(14), and Lemma 4.3. Moreover, it is essentially the only such numerology 
which is consistent with these estimates. 

The main purpose of this section will be to prove a rigorous version of 
Heuristic 6.1; this will be done in Proposition 6.6. Of course, in order to inter- 
pret the term "most" in the above heuristic, the V machinery in the previous 
section will be used heavily. 

We begin with a more accurate version of (41), which may be thought of 
as a dual to the Minkowski dimension estimate (23). 

Proposition 6.2. For each 5 < a <C 1 and x <E R 3 , consider the 
statement 

(42) \E S n B(x, Ca)\ < <T^<5 3 - V. 

Then, if the constants in (42) are chosen appropriately, we have 
T G T, x G T : (42) holds for all5<a<^l. 

Proof. If (42) holds for a, and a' w a, then (42) holds for a' (with slightly 
worse constants). Thus in order to verify (42) for all S < a <S 1, it suffices to 
verify (42) for all a of the form a = 5 ke , where k = 0, . . . , e -1 . Thus it suffices 
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to show that 

V T G T, x G T : (42) holds for all a = 5 ks , k = 0, . . . , e _1 . 
T, x 

Since e _1 « 1, it thus suffices by Lemma 5.2 to show that 

V T G T, x G T : (42) holds for a 
T,x 

uniformly in a. 

Fix a, and let X denote the set 

X = {x G M 3 : (42) fails for a}. 

Then by (30) and (24) it suffices to show 

From (20) and the definition of X we may cover X by a collection B of 
<7-balls with cardinality 

(43) #B < 6^a~ d . 

From elementary geometry we have 

J b Xt£<7- 2 6 2 J b XncAT) 
for all B G B and T G T (cf. (28)). Summing this in B and T we see that 



bgB TeT 



By (8) and (43) this is majorized by 



a- 2 <5v^-ri£xAr CCT(T) iu, 

TeT 

The collection of Ca-tubes A^o-fT) can be partitioned into about <t 2 5 -2 sub- 
collections, each of which are direction-separated. Thus from the triangle in- 
equality and (14), we can majorize the above by 

a- 2 6 2 (a 3 5^a- d )^ d a 2 5- 2 a^ = 5^ d 

as desired. □ 

We combine this property with two others, which are also related to 
Heuristic 6.1. 

Definition 6.3. If xq G M 3 and T p G T p [xo], then we say that Pi(xo,T p ) 
holds if the three statements 

(44) \E S [T P ] n B(x , Cp)\ > 5^5 3 - d p d , 
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(45) V T G T[T p ],x G T n B(x , Cp) : (42) holds for all 6 < a < 1, 
T, x 

(46) V T G T[T p ], x G T n B(x , Cp) : (47) holds 
T, x 

hold, where (47) is the estimate 

(47) ^[T p ]{x)<5-^p-^- d \ 

Proposition 6.4. If the constants in Definition 6.3 are chosen appro- 
priately, then 

V r„€T p ,ar €r p :Pi(xo,2». 

Tp,X 

Proof. If we rearrange Proposition 6.2 as 

V T p G T p , T G T[T p ], x G T : (42) holds for all S < a < 1 
and apply Lemma 5.10, we obtain 

V T p G T p ,x G T p : (45) holds. 
T p ,x 

Next, we show 

V T p G T p ,x G T p : (46) holds. 
T p ,x 

By Lemma 5.10 again, it suffices to show that 

V Tp G Tp, T G T[Tp], x G T : (47) holds. 
Tp , T, x 

By Lemma 5.5 it suffices to show 

V T€T[T p ],x€T: (47) holds 
uniformly in T p . Fixing T p , we rewrite this as 

/ r , h ^s[T p ]<5 c ^ f p s [T p }. 

J^ S [T p ]>S-^ p -(3-d) H J 

However, from (14) we have 

(48) \\p s [Tp]\\ d ,<6 d -^p d -¥. 

The claim then follows (with c = d! — 1) from (10), (26), and some algebra. 
We now show that 

Vx G T p : (44) holds 

Xo 

for all T p G T p ; the proposition then follows from Lemmas 5.5 and 5.2. 
Fix Tp, and let X denote the set 

X = {xo G Tp : (44) fails}. 

By (6), we must show that \X\ < 5 C ^5. 
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If xo £ X, then from (9), (27), and the failure of (44), we have 

llw^lll^^.cp^^^-V)- 1 ^. 

Comparing this with (48) and applying a covering argument one obtains the 
entropy estimate 

£ P (X) < [«5^p^ 1 «5- 3 / 2 («5^ ( 5 3 -V) 1/<i ] d ' 
which simplifies to 

S P {X) <5^ d 'l d 5' 1 ' 2 . 
The desired bound on X then follows from (7). □ 
Definition 6.5. Let xo be a point in M. 3 . We say that ^2(^0) holds if one 

has 

(49) s cv-e p -(s-d) < #r p (x ) < 8- c ^p-^- d \ 

(50) \/T p eT p (x ) : Pi(x ,T p ), 

(51) \E s nB(x ,Cp)\ < 5~ c ^5 3 - d p d , and 

(52) #{T p el p (i„) :dir(T p )G B(u,6)} < ^^-(s-rf) 

for all directions to and all 6 < 9 <C 1. 

Property (52) basically states that the tubes in T p (xq) are not clustered 
in a narrow angular band. 

Proposition 6.6. // the constants in the above definition are chosen 
appropriately, then we have 

(53) VT p eT p ,xeT p :P 2 {x). 

The reader should compare this rigorous proposition with Heuristic 6.1. 

Proof. By Lemma 5.2 it suffices to handle each of the properties in P2(x) 
separately. 

We first observe from Proposition 6.4 and Lemma 5.8 that 



V T p 'eT p ,xeT p ' 



\/T p eT p ,xeT p :P 1 (x,T p ) 



i p ,x J p 
But this is just a rephrasing of 

V T p g T p ,x g T p : (50) holds. 
T p , x 

Next, we show the upper bound in (49): 

V T p G T p ,x G T p : p p {x) < S'^p'^. 
1 p, x 
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We rearrange this using (25) as 



On the other hand, from (14) we have 

(54) W^pT 5 , 

and the claim then follows from (10). 

We now turn to the lower bound in (49): 

V T p G T p , x £ T p : p p {x) > 5 C ^p-^ d \ 

lp,X 

We may rearrange this using (25) as 

(55) / n p < 5 C ^. 

The claim then follows from (11), (21), and the fact that \x p is supported on E p . 
Next, we show 

V T p e T p , x e T p : (52) holds for all directions ui and all 6 < 9 < 1. 

Tp, x 

By repeating the argument in Proposition 6.2, it suffices to obtain 

V T p e T„, x e T p : (52) holds for all directions ui 
T p ,x 

uniformly in 9. 

Fix 9; by (25), it suffices to show that 



where X is the set of points in M 3 for which (52) fails for some u>. 
By (8) and (54) it suffices to show that 



p*r\X\V d <6^, 

or in other words 

\X\ < 5 c ^ P ^ d . 

If (52) fails for some to, it also fails (with slightly different constants) for some 
uj in a fixed c#-separated set. Since this set has cardinality 0(9~ 2 ), it suffices 
to show that 

({a; e R 3 : (52) fails for u}\ < 5 c ^p 3 - d 9 2 

uniformly in uj. 
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Fix u. We must show 



where T p is the set of tubes in T p whose directions lie in B(oo,9). 
By (12) and (14), the left-hand side of (56) is majorized by 

which we rearrange as 

Since d < 3, (56) follows if c is chosen sufficiently small. 

Finally, we address (51). This is easily dealt with because (51) follows 
from the other properties in ^(^o)- To see this, first observe from (49) and 
(50) that Pi(x ,T p ) holds for at least one tube T p £ T p . From (44) and (45) 
we thus see that there exists an x G B(xo,Cp) such that (42) holds for all 
K (7 < 1. Applying this with a = Cp gives the result. (Alternatively, one 
can modify the proof of Proposition 6.2 to obtain (51).) □ 

In the next few sections we will explore some surprising consequences of 
the property P2(xo), namely planiness and graininess. 

7. Triple intersections 

In this section we fix xq to be a point in M 3 such that ^2(^0) holds. 
Let T p be a tube in T p (xq) such that Pi(xq,T p ) holds. By Proposition 6.6 
this situation occurs fairly often. 
Let A(xq,T p ) denote the set 



(56) 



E 



XT, 



(57) A(x ,T p ) = E s [T p ]nB(x ,Cp). 

From (44) we see that 

\A(x ,T p )\ >6 c ^S 3 ~ d p d . 
On the other hand, from (51) we see that 



U A ( x °, T p) 



< s- cv ~ e s 3 - d P d 
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Thus we expect a lot of overlap between the A(xo, T p ). In particular, we expect 
the size of the triple intersection 

(58) A{x , Tl) n A(x , T 2 ) n A(x , T 3 ) 

to be quite large for many triples T l p , i = 1,2, 3. 

However, it turns out that we can get a nontrivial bound on the size of 
(58) if tubes T p are not coplanar. This is basically because (42) forces the 
projections of (58) along the directions of T p to be small. (For a discussion of 
the relationship between the volume of a set and the areas of its projections, 
sec [11].) 

When the tubes T p all lie close to a single plane n, then one cannot directly 
improve the trivial bound on (58) given by (51). However, we can show that 
this trivial bound is only attained when the sets A(xo,T p ) are "grainy," in the 
sense that they consist of a union of squares parallel to n. 

More precisely, we have 

Definition 7.1. Let ir be a plane. We define a square parallel to it to be 
any rectangular box Q of dimensions 5x px p whose two long sides are parallel 

to 7T. 



Lemma 7.2. Let xo be a point in IR 3 such that i^^o) holds, and let 
T p ,Tp,Tp be three tubes in T p (xo). Let Vi be any vectors such that 

(59) v i = pdiT{T i p ) + 0{5). 

Let ir(v\,V2) denote the plane spanned by v\ and V2, 1 < M <C <5 -1 / 4 be an 
arbitrary number, and F be an arbitrary subset o/R 3 . 

• (Nonplanar tubes). If Z.v\,V2 > M -1 , Zw(vi,V2),V3 > pM 2 , and (42) 
holds for all x £ F and S < a < 1, then 

(60) \A(x ,T p ) n A(x , T 2 p ) n A(x , T p ) n F\ < S'^S 3 ' 4 'p d M- c : . 

• (Planar tubes). If Zv±,V2 > M _1 and Zir(v i, ^2), W3 < pM 2 t/ien 



(61) 



lACxo.Tp 1 ) n a(x ,t 2 ) nA(x ,T*) n F| < r c ^-' ! /M c ^up^^^) , 

where Q ranges over all squares parallel to tt(vi,V2). 
Here c > and C > are absolute constants. 
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Figure 3. The shaded region is a portion of a typical A4. Note that 
Ai essentially consists of 5 x p tubes oriented in the direction Vi. 

Proof. Write Ai = A(xo,T l p ) for short. The key observation is that for 
every T £ TT[Tp], the set T n B(xo,Cp) is essentially constant in the direction 
V{. More precisely, we have the elementary pointwise estimate (see Figure 3) 

(62) 

XB(x ,Cp)nT (XB(x ,2Cp)nCT) 

where E v is the averaging operator 

K(f(x)) = I f(x + vt) dt. 
■'1*1^1 

Summing this in T we obtain 

(63) xa^KAXa,) 
where 

Ai = B{x ,2Cp)n |J CT. 

TeT[T;] 

It is easy to see that ~ \Ai\, since Ai is essentially a union of parallel 5 x p 
tubes and Ai is essentially the union of the dilates of these tubes. In particular, 
from (51) we have 

(64) \Ai\< 5- c ^5 3 - d p d . 

To utilize (63) we invoke the following estimate, which is a variant of the 
argument in [11]. 
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Lemma 7.3. Let v±, t> 2 , v 3 be any three linearly independent vectors, and 
let F be a subset o/M 3 . Then for any functions f±, f'2, f 3 , 

1/3 3 



E„ 1 (/ 1 )E„ 2 (/ 2 )E„ 3 (/ 3 ) < sup (^p) fl ll/ills. 



IF 

where P ranges over all parallelopipeds with edge vectors v±, V2, v 3 . 

We remark that the factor supp(^^p) 1 / 3 represents a gain over the trivial 
estimate obtained via Holder's inequality. 

Proof. The statement of the lemma is invariant under affine transforma- 
tions, so we may rescale v±, i> 2 , v 3 to be the cardinal directions e±, e2, e 3 
respectively. It suffices to show that 

3 

/ E„ 1 (/ 1 )E, 2 (/ 2 )E, 3 (/ 3 ) < (iFnPl) 1 / 3 !] \\fi\\ L3(C p), 

for all unit cubes P, since the claim follows by summing over a partition of 
K 3 and using Holder's inequality. We may assume that P is centered at the 
origin, that F C P and that /j are supported on CP. 
We have the pointwise estimate 

K 1 (fi){xi,x 2 ,x 3 ) <J 1 (x 2 ,x 3 ), 

where x = (x\,X2,x 3 ) and 

7i(x2,x 3 )= fi(xi,x 2 ,x 3 ) dX!, 

^|a=i|<l 

similarly for cyclic permutations of 1,2,3. Since \\fi\\ 3 < \\fi\\ 3 by Young's 
inequality, we are thus reduced to showing that 

J p XF{x)J 1 {x2,X 3 )J 2 (x 3 ,Xij] 3 (xi,X 2 ) dxidx 2 dx 3 < | | 1/3 | 1 1 3 I 1 1 3 1 |T^3 1 1 3 - 

We rewrite this as 

/ / Xf( x 2,x 3 )J 1 {x2,x 3 )J1 1 (x 3 )J1 1 (x2) dx 2 dx 3 ) dXi, 

J\X!\<1 \^|X 2 |,|X3|<1 / 

where Xf ( x 2, ^3) = Xf(xi, x 2 , x 3 ), etc. By Holder's inequality this is ma- 
jorized by 

/ WXF'hUihWTs'^hdxu 

J|xi|<l 

where the norms are taken in the x 2 , x 3 variables only. We can simplify this 
to 



ll/ills / Wllsll^llsll^llscfai, 

J\xi\<l 



'\xi\<l 

and the claim follows from applying Holder in the x\ variable. □ 
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Figure 4. The two cases of Lemma 7.2. 
Combining this estimate with (63) and (64) we obtain 



(65) |Ai n A 2 n A 3 n F\ < 5' c ^ e 5 3 - d p d (^sup ■ 

where P ranges over all parallelepipeds with edge vectors v±, v%, v%. 
We now prove the two claims of Lemma 7.2. 

Proof of (60). Since A; t C Eg, we may assume without loss of generality 
that F C Eg. Since we are assuming (42) to hold on F, we thus see that 



\B\ ~ \5 

for any ball of radius a. 

Let P be any parallelopiped with edge vectors v\, V2, V3. From the hy- 
potheses on vi, V2, V3 and some elementary geometry we see that 

|P| ~ \N MS (P)\, 
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so that P can be efficiently covered by a finitely overlapping collection of M5- 
balls. From a simple covering argument we thus obtain 



uniformly in P, and (60) follows from (65). 

Proof of (61). By perturbing Vi if necessary we may assume that 
Ztt(vi, V2), V3 > 5. The claim then follows from (65) and the geometric ob- 
servation that, if P is any parallelopiped with edge vectors v±, V2, V3, then P 
can be covered by 0(M 2 ) squares Q parallel to tv(vi,V2), and \P\ > M~ 1 \Q\ 
for any one of these squares. □ 

8. Consequences of Lemma 7.2: Planiness and graininess 

In this section we shall utilize the first conclusion Lemma 7.2 to enforce a 
"planiness" property on the tubes in T p ; roughly speaking, we require for all 
xq satisfying the property P2 that most of the tubes in T p (xo) to be coplanar. 
Then we will shift scales in order to also enforce this property on the (5-tubes 
T. Later in this section we will also use the second conclusion of Lemma 7.2 
to also enforce a certain "graininess" property on the set Eg. 

We begin this program with 

Proposition 8.1. Let xo be such that i^C^o) holds. Then one can find a 
family of planes H p = H p (xo) through xq and subcollections T p (xo,ir) C T p (a;o) 
for all 7r 6 ILj(xo) such that 

• For all it £ Ilp(xo) and T p £ T p (xq, vr) we have 



FDP 
\P\ 



(66) 



T P C iV(7T), 



where N(ir) is the 5 p- neighbourhood ofw, and C2 is a constant to 
be chosen later. 

The T p (xo,vr) are disjoint as tt varies, and 



(67) 



#T p (x ,vr) >5 c ^p-^ 

for all tt G U p (xq). 



We have 



(68) 



1 < #n p (x ) < 



We have 



(69) 



yT p € T p (xq) : T p G (J T p (x ,vr) 
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• For every ir G IL p (xo), the set T p (xq,tt) is a subset of 

(70) {T p eT p (x ) :Pi(s ,2»}. 

For the purposes of visualization we recommend thinking of H p as con- 
sisting of a single plane n, and T p (xo,vr) as equal to T p (xo). 

Proof. The idea of the proof is to iterate the following lemma. 

Lemma 8.2. Let xq be such that ^2(^0) holds. Suppose that T' p is a subset 
of (70) such that 

(71) #T' p > 5^#T p (x ). 
Then one can find a plane tt through xq such that 

(72) #{T p G T' p : T p C iV(vr)} > S C ' 2 ^ e p'^ , 
if C2 is chosen sufficiently large. 

Proof. Suppose for contradiction that (72) failed for all planes tt through xq. 
Let F denote the set 

(73) F = {x G E s fl B(x , Cp) : (42) holds for all S < a < 1}. 
Let T p be any element of T' p . From (45), (46), and Lemma 5.2 we have 

(74) V T G T[T p ],x G T n B(xq, Cp) : x G F, [T„] (re) < S'^p-^. 
T, x 

From (31) we thus obtain 

^Fn{ W [T p ]<«5-v^p-(3-d)} Jb(xo,C P ) 

By (27) and (11) we see that 

where A(xo,T p ) was defined in (57). We rearrange this as 

/ XA( X0 ,T P ) £ 5^~ d p d . 

Summing this over all T p G T' p and using (71) we obtain 

(75) / E XA(, ,T p )^^#TpWrV]. 

F T p £T' p 
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On the other hand, we will shortly prove the estimate 
(76) 

EEE \A(x ,Ti)nA(x ,T;)nA(x ,T%)nF\ < S- c ^S cC ^S 3 - d p d #T p (x ) 3 , 

•-pi T""2 7^3 

p p p 

where the tubes T l p will always be assumed to range over T' and c denotes 
a constant independent of C 2 and e but which can vary from line to line. 
Rewriting the left-hand side of (76) as 



3 

p) 



T p &T p 



L3(F) 

and using (8) (with d replaced by 3/2) and (51), we thus obtain 



J2 XA( 



T p &T p 



< 5- c ^5 cC2 ^5 3 - d p d #T p (x ) 

Li(F) ~ 



which contradicts (75) if C 2 is chosen sufficiently large. 

It remains to prove (76). Fix T p \T p 2 ,T p 3 , and choose Vi so that (59) holds. 
We divide the contributions to (76) into three cases. 

First, we consider the contribution when v\ and v 2 are close together, or 
more precisely when 

(77) Z Vl ,v 2 < 5 C2 ^ 2 . 

Estimating the summand crudely by (51), we may majorize the contribu- 
tion of this case by 

(78) 6- c ^8 3 - d p d #{{T},rf,Tf) e T' p 3 : (77) holds}. 
However, by (52) we see that for any fixed T p there are at most 

< d -CVe^C 2 V~e/2y p -(3-d) 

tubes T 2 for which (77) holds; by (49) this is majorized by 



<5- c ^5 cC2 ^#T p (x ). 



Thus the contribution of this case to (76) is acceptable. 

Next we estimate the contribution for which (77) fails, but for which 

(79) Ztt{v 1 ,v 2 ),v 3 < 6- C2 ^~ e p. 

where n(v±,v 2 ) is the plane spanned by v\, v 2 . By crudely estimating the 
summand by (51) again, we see that the contribution to (76) is at most 

r C^3-y #{(T l )T 2 )T 3 } G (T p3 . (79) holds} . 
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However, we are assuming (72) to fail for all planes tt. This implies that for 
any fixed T^, T%, there are at most ^^-(s-d) tu b es T 3 for which (79) holds. 
By repeating the previous calculation we see that the contribution of this case 
to (76) is also acceptable. 

Finally, we consider the contribution for which (77) and (79) both fail. In 
this case the first part Lemma 7.3 applies with M = <5 _c ' 2 ^/ 2 . Using (60) we 
therefore see that this contribution is also acceptable. Combining the three 
contributions together we obtain the desired estimate. □ 

With this lemma it is easy to prove the proposition by the following con- 
struction. Initialize U p (xo) to be the empty set and T' to be (70). From (50) 
we see that (71) currently holds. 

We now perform the following iteration. We use Lemma 8.2 to extract a 
plane tt which obeys (72) with respect to T' . We add this plane tt to H p (xo) 
and define 

T p (x ,7T) = {T p eT' p :T p cN(7T)}. 

We now remove all the elements of T p (xq, it) from and repeat the iteration 
until (71) fails, at which point we halt the construction. 

It is easy to verify that this iteration gives all the properties claimed in 
the proposition. The bound (69) follows by construction. The upper bound on 
#Hp follows from (72) and (49), while the lower bound on #U p follows from 
(69). □ 

Henceforth we assume that the collections H p (xq) and T p (xq, tt) have been 
chosen for all xq such that P2(xq) held. 

Having obtained planiness, we now turn to the companion property of 
graininess. We remark that the heuristic that one should be able to reduce 
the study of Besicovitch sets to the grainy case is due to Tom Wolff (private 
communication). Much of the work in this paper was motivated by a desire to 
make this heuristic rigorous. 

Definition 8.3. Let tt be a plane, and xo be a point in M 3 . A set S is 
said to be grainy with respect to (xo,tt) if S C B(xo,Cp) and 

(80) S = (J Q 

for some collection Q of squares Q parallel to tt such that 

(81) #q<5- c ^~ e 5 (2 - d)/2 . 

If 5 is a grainy set, we define its dilate CS to be 

CS= |J CQ 

where CQ is the dilate of Q by C around its center. 



416 



NETS HAWK KATZ, IZABELLA LABA, AND TERENCE TAO 



Note if S is grainy, then CS is also grainy with slightly larger constants. 
Also, from (81) we have 

(82) \S\,\CS\ < 5p 2 #Q < S- c ^5 3 - d p d . 

Proposition 8.4. Let the hypotheses and notation be as in Proposition 8.1. 
Then for each ir G H p (xo) there exists a set S(xo,ir) which is grainy with respect 
to (xq,tt), such that 

(83) V T p G T p (x ,ir),T G T[T p ],x G Tf~)B(x ,Cp) : x G S(x ,ir). 
Tp,T, x 

Proof. We will modify the argument in Proposition 8.1. 

Fix xo, 7r. Cover B(xQ,Cp) by a finitely overlapping collection of squares 
Q parallel to ir. Define Q(xo,vr) to be the subcollection of these squares Q for 
which 

(84) > S°>* 

where C3 > is a constant we will choose later. Define S(xo, ir) by 

S(x ,tt) = |J Q. 

Q£Q(a;o,7r) 

From (84), (51) and the estimate \Q\ ~ 5 2 we obtain (81), and so S(xo,ir) is 
grainy with respect to (xo,ir). 
Define the set X by 

X = (E s nB{x ,Cp))\S{x ,ir). 

Then by construction 

(85) sup ]££GI<*<Wi 

Q \Q\ 

where Q ranges over all squares parallel to ir. 

To finish the proof we must show (83). By summing (46) for all T p G 
T p (xQ,ir) we have 

V T p G Tp(x ,ir),T £ T[T p ],x G T n B(x , C p) : (47) holds, 
-^p > x 

Thus by Lemma 5.2 it suffices to show that 



VT p G T p (xo,ir),T G T[T p ],x G TnB(x ,Cp) : ((47) =^^I). 
^p 

We rewrite this as 



(86) / E H8[T P ]<5^ f £ W [T p ]. 
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Figure 5. A (simplified) typical intersection of E§ with a p-cube. 
In this case the plane tt is oriented in the xz direction. 



We majorize the left-hand side of (86) as 



XA(x ,T p )- 



The right-hand side of (86) can be estimated using (67) and (27). Thus (86) 
reduces to showing that 

T p eT p (x ,n) 

for some sufficiently large C. This simplifies by (49) to 

/ E XA(*o,T„) £ 5 c ^5 3 ~ d p d #T p (x ). 

T p £T p (x ,n) 

We will shortly prove 
(87) 



E E E M*o, T}) n A(x , T 2 p ) n A(x , T p 3 ) n X\ 
< s- c ^S cC3 ^ e 5 3 - d p d #T p {x ) 3 , 



Tl T 2 T3 

P P P 



where the T p are assumed to range over T p (xo,tt); the desired estimate then 
follows as before from (8) with d replaced by 3/2 and then (51), if C3 is chosen 
sufficiently large. 

As in the proof of Proposition 8.1, the contributions to (87) when 

(88) Z Vl ,v 2 £8 C3 ^~ e 
or when 

(89) Ztt(v 1 ,v 2 ),v 3 >S- 2C3 ^p. 
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are acceptable if C3 is sufficiently large. Thus it only remains to consider the 
contribution for which (88) and (89) both fail. However, in this case the second 
part of Lemma 7.2 applies. From (61) and (85) we have 

\A(x , T l p ) n A(x , T 2 p ) n A(x , T 3 ) n X\ < 5- c ^5 3 ~ d p d 5 cC3 ^. 

Thus the contribution from this case to (87) are also acceptable, as desired. □ 

Henceforth we assume that the sets S(xq,tt) have been chosen for all xo 
such that P2(xo) holds, and all ir G IL p (xo). The constants C2 and C3 are now 
fixed, and future constants C may implicitly depend on these two. 

9. Consistency of planes and squares 

In Proposition 6.6 we showed that any sticky collection of tubes satisfying 
(20) and (21) had to satisfy the property P2(x) extremely often. This property 
in turn implied two strong properties on the collection, namely planiness and 
graininess, by Propositions 8.1 and 8.4. 

The planiness property is really a property about the p-tubes T p rather 
than the (5-tubes T. By taking advantage of the double-sticky assumption and 
changing scale from 5 to S 2 , we shall be able to also impose a planiness property 
in the 5-tubes. More precisely, we show 

Definition 9.1. Define Pz{x) to be property that one can find a family 
of planes Hs(x) through x and subcollections T(x,ir) C T(x) for all tt G 11,5 (x) 
such that: 

• For all 7r G n^(x) and T G T(x, it) we have 

(90) T C N s - C * s {*). 

• The T(x, 7r) are disjoint as tt varies, and #T(x, ?r) > 5 c ^5 d - 3 for all 
vr G U s (x). 

• We have 

(91) 1 < #n 5 (x) < 5~ c ^. 

• We have 

(92) VT G T(x) : T G T(x, vr) for some vr G U 5 (x). 

• We have 

(93) #{T G T(x ) : dir(T) G B(co, 6)} < r c ^ c (5^ 3 
for all directions u> and all 5 < 9 <C 1. 
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Proposition 9.2. Suppose that there is an X-ray estimate at dimension 
d, and that there exists a Besicovitch set E with dim(£ l ) < d + e. Then there 
exists a sticky collection T of tubes such that (20) and (21) hold, and such that 

(94) V T G T,x G T : P 3 (x). 

T, x 

Proof. By Proposition 3.3 with 5 replaced by 5 2 , we can find a doubly 
sticky collection ¥,52 of tubes with parent collection T5 and grandparent col- 
lection T p which obey the analogues of (20), (21), and (22) with 5 replaced by 
5 2 . By Proposition 6.6 the property P2 (with 5 replaced by 5 2 ) holds almost 
always, and the conclusions of Proposition 8.1 obtain whenever P2 is satisfied. 
Similarly the property (52) with 5 replaced by 5 2 also holds almost always. 
By comparing these statements against Definition 9.1 we see that the desired 
properties obtain with T = T$. □ 

Henceforth we fix T to be a collection of tubes satisfying all the conclusions 
of Proposition 9.2, and assume that we have chosen the collections 11,5 (x) an d 
T(x,7r) for all x such that Pz(x) holds, and all tt G H§(x). This changing 
of scale from 5 to p is intriguingly reminiscent of the techniques employed in 
proving restriction theorems (see e.g. [1], [2], [3], [16], [14]). 

Since T is sticky and satisfies (20) and (21), we may apply Proposition 6.6 
again and obtain (53). In particular, the conclusions of Propositions 8.1 and 
8.4 obtain for a large number of xo- 

We thus have two scales of structure on our set, one at scale 5 arising from 
P3, and one arising from the propositions in the previous section. Fortunately, 
there is a close relationship between the two structures, namely that the planes 
in Tls(x) and the planes in Ilp(xo) are mostly parallel if x G B{xq,C p). In 
particular, the planes in Tls(x) are mostly parallel to the squares Q that contain 
x. More precisely: 

Definition 9.3. We define ^4(^0, x, T p , T) to be the property that ^2(^0) 
and Pz{x) hold, that x G B(xo,Cp), and there exist planes tt 6 Us (a;), tt' G 
n p (x ) such that T G T(x,vr) D T[T„], T p G T p (x ,vr'), x G S(x ,tt'), and we 
have the consistency condition 

(95) Ztt,it' <8- c ^~ e p. 



Proposition 9.4. // the constants in the above definition are chosen 
appropriately, then we have 
(96) 

V T p eT p ,x eT p ,T eT[T p },x eTDB(x ,Cp) : P 4 (x ,x,T p ,T). 

1 pi 2-0) J- > 3? 
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Figure 6. A typical situation in which P4,(xq, x, T p , T) holds. The 
tubes through x lie on the plane tt, which is essentially parallel 
to the square Q C S(xo,tt') which contains x. The plane tt' (not 
pictured) differs in angle from tt by about p. 

Unfortunately, the proof of this proposition is the most technical part 
of the paper, and mostly consists of the combinatorial reshuffling machinery 
developed in Section 5. Heuristically, the idea is as follows. If the planes in 
ILj(x) are at a large angle to the planes in ILj(xo), then any tubes T which 
are parallel to both tubes will be constrained to a narrow angular band. By 
(52) this would imply that very few tubes are parallel to both kinds of planes, 
which will eventually contradict (66), (50) and (90), (69). 

Proof. We need to show that V Q : P4, where Q = Q(xq,x,T p ,T) 
denotes the property that T p ,xq,T,x 

T p G T p , x £T p ,TG T[T p ],x 6 Tfl B(x , Cp). 

Apart from the condition (95), this can be achieved simply by applying 
the machinery of Section 5 to the previous propositions, as follows. 
From (69) and Lemma 5.5 we have 

V ' P 2 (x ),T p eT p (x ):T p e \J T p (x ,ir'). 

Clearly this implies 

V P 2 (x ), T p G T p (x ) : [ V T € T[T p ],x £ T : T p G |J T p (x , tt') 
x o> t p T ' x 7r'en p (x ) 
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By Lemma 5.10 we therefore see that 

(97) V P 2 (x ),Q:T p e (J T p (x ,tt'). 
xo,x,l p ,l 7r'en p (xo) 

On the other hand, from (53) and another application of Lemma 5.6 we 

have 

V Q:P 2 (x ). 

XQi X, 1 p, 1 

Combining this with (97) using Lemma 5.4 we obtain 

(98) V Q:P 2 (xo),T p G (J T p (x ,n'). 
xo,x,l p ,l n 'en p (x ) 

Now from (83) and Lemma 5.5 we have 

, V P 2 (x ),ir' G n p (x ),T p G T p (x ,tt'), 
X ,7T ,T p ,T,x 

T G T[T p ],x G TnB(x ,Cp) : x G S(x ,tt'), 
which we may rearrange using Corollary 5.7 as 

V Q, P 2 { x o), T p G |J T p (x ,tt') : x G S(x ,7r') 
xo,x,lp,l K'en p (x ) 

with the understanding that ir' is the unique plane in U p (xq) such that T p G 
T p (xo,vr'). Combining this with (98) via Lemma 5.4, we obtain 

(99) V Q : R 

XQi x^ T p , T 

where R = R(xo, x,T p ,T) denotes the property that P 2 (xo) holds, and for 
some 7r' G IIp(xo) we have T p G T p (xo, n') and x G S(xo, it'). 
From (92) and Lemma 5.5 we have 

V i P 3 (x),TeT(x) :TG (J T(x,tt). 
x ' 1 Trends) 

We rearrange this using Corollary 5.7 as 

V T p G T p , T G T[T p ], x G T, Pz(x) : T G |J T(x,tt). 

T P' T i X n<=Yl s (x) 

By (94) and Lemma 5.4 we thus have 

V T p G T p , T G T[Tp], x G T : i?', 
J p, i , x 

where i?' = R'(x,T) denotes the property that 

P 3 (x), andTG |J T(x,vr). 
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By Lemma 5.10 we then obtain 

(100) V Q : R'. 

The property P^xo, x, T p , T) is equivalent to 

R, R', (95) 

where C4 is a constant to be chosen later, and we adopt the convention that ir 
is the unique plane in ILj(x) such that T G T(x,7r). Thus by (99), (100), and 
Corollary 5.3 it only remains to show that 

V Q (95)), 

or in other words that 
(101) 

|{(x ,x,T p ,T) iQ^^'^^Tr'^r^pjl < J c ^|{(xo,x,T p ,T) :Q}|. 
The left-hand side of (101) is majorized by 

(102)\{{xo,x,T p ,T,ir,ir') : P 2 {x ),P 3 {x),it' G U p (x ),ir e U s (x),\x - x \ < p, 
T p G T p (x ,tt'),T G T[T p ] nT(x,7r),Z7r,7r' > r C4 ^p}|. 

Of course, T p , T, 7r, 7r' are measured by counting measure and xq, x by Lebesgue 
measure. 

Fix xq, x, 7T, 7r' such that 

P 2 (xo), J P3(^),7r / G n p (x ),7r G U s {x),Ztt,tt' > b~ C ^p, 

and suppose that T p and T be such that T p G T p (xo, ir') and T G T[T p ]nT(x, 7r). 
From (66) and (90) we see that T is contained in the (5 _c,v ^/9-neighbourhood 
of ir' , and the ^^^ct-neighbourhood of ir. From the angular separation as- 
sumption on ir and ir' and elementary geometry this implies that the direction 
of T is constrained to a cap of diameter 0{5~ c ^5 Ci ^), whose center depends 
only on ir and ir' . From (93) we thus see that the contribution of (xo, x, it, it') 
to (102) is majorized by 

§- C ^- C^/I ^Ca y/e^ _ 

We may therefore majorize (102) by 

E E r^r^ 74 ^)^- 3 dxdx . 

p 2 (zo),p 3 (x),\x-x \<cp W / G n p (x ) Tren,^) 
From (68) and (91) this is majorized by 

5 -C^e 5 cC^e 5 d-Z [If dx) dx . 

•MOo) \JE s nB(x ,Cp) J 
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By (51), the inner integral is ;$ 5~ c v^5 3 ~ d p d . From (21) we can therefore 
majorize this as 

(103) 5 -c^ 5 cc A ^ 5 yi. 

We now turn our attention to the right-hand side of (101), which we can 
rewrite as 

^ II EE Xt p (x )xt(x) dxdx 

J J \ x - x o\< c P T p £jpT£T[T p ] 

for some c > 0. Evaluating the x-integral using (27), we see that this is 

«5 C ^/ E XT p (x )5 3 / 2 dx , 
T p eT p 



which by (25) is 

« <5 C ^<5 3 / 2 . 

Comparing this with (103) we obtain (101) as desired, if C4 is chosen sufficiently 
large. □ 

Applying Lemma 5.10 to this proposition, we obtain the main result of 
this section, which is phrased in terms of a rather complicated property P5. 

Corollary 9.5. If P^(xo,T p ) denotes the property that 

(104) V T g T[T p ],x £TnB(x ,Cp) : P 4 (x ,x,T), 
T, x 

then one has 

(105) V T p G T p , x eT p : P 5 {x ,T p ). 
T p ,x 



10. An arithmetic progression of length three 

In the last few sections we have been steadily accumulating structural 
information on a collection T of tubes. This program has culminated in Corol- 
lary 9.5, in which we have shown that a rather complicated property P5 holds 
almost always. 

We have taken great pains to ensure that this property P5 almost never 
failed. We will now see the payoff for this carefulness, in that we will be easily 
able to find three widely spaced points in arithmetic progression for which P5 
holds. Furthermore we shall be able to impose an additional technical condi- 
tion, namely that the planes associated to these points are not too parallel. 
More precisely: 
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Proposition 10.1. Let T be a sticky collection of tubes satisfying (105). 
Then there exists a tube T p G T p and three points x$,xq + s,xq + 2s in T p 
such that P§{xq + si,T p ) holds for i = 0,1,2. Furthermore, \s\ ~ 1 and 
Ztt(xq + si),ir(xQ + sj) 2; S c& ^ for distinct i,j £ {0,1,2}, where ir(x) de- 
notes the plane for which T p £ T p (x,tt(x)), and C$ > is a suitable constant. 

Note that the plane tt(x) is well defined if P$(x, T p ) holds. 
Proof. From (105) and Lemma 5.9 we have 

VT p eT p :$x£T p :P 5 (x,T p )}. 

Thus by (31) we can find a tube T p 6 T p such that 

(106) VxGT p :P 5 (x,T p ). 
Fix such a tube T p , and let A denote the set 

A = {(x ,s) £l 3 xR 3 : \s\ ~ 1, xo + sie T p for i = 0,1,2} . 

Note that xq lives in T p , and s lives in a translate of CT p . We have to show 
that the set 

{(x ,s) G ^ : P 5 (x + si,T p ), Zir(x + si),7r(x + sj) > b c ^ 

for all distinct i,j = 0, 1, 2} 
is nonempty. On the other hand, from (106) we have 

\{(x ,s) £ A : P 5 (x + si,T p ) fails}) ^ S\{x £ T p : P 5 (x,T p ) fails}) < 5 C ^5 2 
for alH = 0, 1, 2. Since \A\ ~ 5 2 , it will therefore suffice to show that 

\{(x ,s) e A : P 5 (xo+si,T p ),P 5 (xQ+sj,T p ),Z7r(x Q +si),n(x +sj) < 5 c ^ e }\ < 6 C ^S 2 

for all distinct i,j = 0, 1, 2. Making the change of variables x = xq + si and 
y = xq + sj, it therefore suffices to show that 

(107) \{{x,y) £ F x F : Z(n(x), ir(y)) < 5 c ^ e }\ < 5 c ^~ e 5 2 
for some c > 0, where F denotes the set 

F = {x£T p :P 5 (x,T p )}. 

The idea is as follows. By using the Kakeya estimate (14) one can easily 
recover the trivial estimate for (107) with c = 0. To improve upon this we use 
the X-ray estimate (15) instead, observing that if the squares corresponding to 
different x are almost parallel then there will be many parallel tubes in each 
direction. 
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We now turn to the details. Let II be a maximal 5 Ch ^-angular-separated 
set of planes containing the axis of T p . If x G F, then there exists a n G II 
such that Z(tt(x),tt) < 5 c ^ e . We can therefore majorize the left-hand side of 

(107) by 

(107) < ]T \{(x,y) GFxF:Z(vr(x),7r),Z(7r(y),7r) < S C ^} 
wen 

which we rewrite as 

(108) (107) < \ { x G F '■ A*(*M «^ C5V ^}| 2 - 

wen 

On the other hand, we clearly have 



< \F\ < S. 



\ {* e F ■ A*(*)>*) £ 8 C5V ~ £ } 

vren 

It thus suffices to show 

(109) |{x G F : Z(7r(x),vr) < <5 C5 v^J| < S^^S'^S 

uniformly in it, since (108) clearly follows from the above two statements, for 
a sufficiently large choice of C5. 

We now prove (109). Fix it, and let F 1 denote the set 

F' = \x G F : Z(^(x),vr) < <5 C5 ^} . 

Suppose that x was an element of F' . From (104) and the definition of P4 we 
have 

V T G T[T p ],y€ B(x,Cp) : y G S(x,ir(x)). 

y,T 

By (31) and (27) we therefore have 

(110) / ]T X T(y)^«^ 2 . 

JS(x,ir(x)) Tgf^Tp} 

Let / denote the one-dimensional interval in M 3 centered at the origin, parallel 
to 7r, orthogonal to the axis of T p , and having length c) -0 ' 5 ^^, and let V be 
a maximal ^-separated subset of /. If Q is a square in S(x,ir(x)) and T is a 
tube in T[T p ], then (66), the angular separation between ir(x) and tt, and some 
elementary geometry shows that 

\TnQ\<s c ^J2\( T + v ) nC Ql 

where T + v is the translate of T by v, and CQ is a dilate of Q. Therefore 
/ Xr(y) dy<6 c ^J2 I Xt+v{v) dy 
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where CS(x,ir(x)) was defined in Definition 8.3. Combining this with (110) 
we obtain 

/ EE XT +V {y)dy>5- C ^'\ 

JCS(x,tt(x)) v( -y Tg tt[T p ] 

From (82), (9) and some algebra we therefore obtain 

/ (E E XT +V (y)) dy>5 c ^6- d ' c ^- d '. 

JB(x,Cp) \ ve VTeT[T p ] J 

Integrating this over F' we obtain 

[\F'nB(y,Cp)\ (]T E XT + v(y)) dy>5 c ^5- d ' c ^5 3 - d '\F'\. 

Since \F' n B(y, Cp)\ < 5 3/2 , we thus obtain 

d! 



(Ill) 



E E XT+v(y) 

v&V TgT[T p ] 



d' 



On the other hand, the directional multiplicity of the collection {T + v : T G T, 
v £ V} is at most w #y « (5" ar,v/i . Thus by (15), we have 

E E XT + „(y) <^- 1 (r c ^^) 1 -^ 

dSV TeT[T p ] d ' 

for some (3 > 0. Combining the two estimates gives (109) with c = /3d', as 
desired. □ 

The above proposition yielded a single arithmetic progression Xq, x®, x\ 
in a single p-tube T p of length three which had the property P5 . By definition 
of P5, there are many points in B(x®,Cp), i = 0, 1,2, with the property P4. 
We therefore expect a large number (about 5 _1 ) of tubes T in T[T p ] to satisfy 
property P4 when they pass near x®. 

The property P4 is of course very complicated. However, we shall need 
only a few of the components of P4 for the purposes of our argument, namely 
the graininess property, the planiness property at scale 5, and the consistency 
condition (95). More precisely, the proposition we will need for later sections 
is as follows. 

Proposition 10.2. Suppose we have an X-ray estimate at dimension d, 
and there exists a Besicovitch set of Minkowski dimension at most d + e. Then 
for any < 5 <C 1 we can find a p-tube T p , a direction- separated set T[T p ] of 
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5-tubes in T p with cardinality <; 5 c ^5~ l , three points in arithmetic progres- 
sion in T p with spacing ~ 1, and three planes ttq, tv®, ir 2 each containing the 
axis ofTp, such that 

(112) Z7T°,7T°>(^ 

for all i / j. Furthermore, for each i = 0, 1, 2 we can find sets Si which are 
grainy with respect to (x- ) ,7r- ) ), and to each X{ G Si we can associate a plane 
TTi(xi) containing Xi, such that 

(113) Zin^),^ <5~ c ^p 
holds for all i and Xi G Si, and 

(114) f #{T G T[T p ] n T(x ) n T(x 2 ) : T G N(tt (x )) n ^(vr 2 (x 2 ))} dx dx 2 

> 8 C ^5\ 

where 
and 

X = |(a; ,iC2) e S x S 2 : ^(x + x 2 ) G Si 



Note that this proposition differs from the previous ones, in that the V 
symbol is completely absent. That stringent notation was necessary only in 
order to obtain the arithmetic progression in Proposition 10.1. Henceforth we 
shall be content with properties which are satisfiedS only about 5 Cy ^ of the 
time. 

This proposition is not so much a statement on the collection T, but rather 
on the collection T[T p ], which can be regarded as a rescaled Besicovitch set. 
We are restricting ourselves to this portion of T in order to take full advantage 
of graininess, and will no longer concern ourselves with the remainder of T. 

Proof. Fix Tp, xq, s so that the conclusions of Proposition 10.1 hold. Set 
x® equal to xq + si and Si equal to the set CS(x ( -,tt(x ( -)). Set ir® to be any 
plane within an angle of 5~ c ^p to vr(x^) which contains the axis of T p ; such a 
plane exists from (66). Note that since Si is grainy with respect to (x^,7r(xf)), 
it is also grainy with respect to (x°,7r?) with slightly worse constants. 

Let i = 0, 1, 2. From the definition of P§(x®, T p ) we have 

V T G T[T p ], Xi G THBixlCp) : P 4 (x°, x u T, T p ). 

1 , Xi 



See however Proposition 14.5 in the Appendix. 
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By Lemma 5.6 we therefore have 

m V T€T[T p ],Xj G Tn B{x% Cp) for j = 0, 1, 2 : P 4 (x°, x,, T,T P ). 
T,x ,xi,x 2 

From Lemma 5.2 we thus have 

V T G T[T p ], sj G T n Cp) for j = 0, 1, 2 : P 4 (x°, a*, T, T p ) 

T, x ,xi,x 2 

holds for all i. Applying (31), we obtain 

3 3 

E ni{^GTn J B(^,C / 9):P 4 (^,x l ,T,T p )}| « E iTnBfe , Cp)|. 

TeT[T p ] i=l TeT[T p ] i=l 

Computing the right-hand side using (28) and (19) we thus obtain 

3 

( 115 ) E \{\{x^T^B(xtC P ):P^xtx l ,T,T p )}\>8 1 ^ 2 . 

TeT[T p ] i=l 

For i = 0, 2, define the sets 

= {x* G 5i : P 3 (x*)}. 

For any xq, x 2 , T, the contribution to the above sum is 0(<5 5 / 2 ). Furthermore, 
from elementary geometry and the definition of P 4 and Si, we see that this 
contribution vanishes unless Xo £ T H Fo, x 2 G TnF 2 , and (xo,x 2 ) G X. Thus 
we have 

E Kfco.afe) G Xn((TnF )x(rnF 2 )) : P 4 (x°,x t ,T,T p ) holds for i = 0,2}| > <5 4 . 

TeT[T p ] 

We may rewrite the above as 
(116) 

/ #{T G T[T P ] : P 4 (x°, x , T, T p ), P^x", x 2 , T, T p )} dx dx 2 > ,5 4 . 

JXn(F xF 2 ) 

From the definition of P4, we have 

#{T G T[T p ] : P 4 (x°,Xi,T,T p ) holds for i = 0,2} 

« E E #{TGT[T p ]nT(x )nT(x 2 ):TGiV(7ro)niV(7r 2 )}, 

T () enO(a;o) 7r 2 en|(x 2 ) 

where 

nj(xi) = {vr G n 5 (x 4 ) : Z^ < <T C ^p}. 

It therefore follows that we may find a way of assigning to each Xj G Pj, i = 0, 2 
a plane 7Tj(xj) G n^(xj) such that 

(117) / #{T G T[T p ] n T(xo) n T(x 2 ) : 

Jxn(F xF 2 ) 

T G N(7T (x )) n iV(^ 2 (x 2 ))} dx dx 2 > 6 CV ~ e S 4 



MINKOWSKI DIMENSION OF BESICOVITCH SETS IN K 3 



429 



for some large C. For, if (117) failed for all choices of assignments 7To(xo) 
and 7T2(x2), then we could average over all such assignments using (91) and 
contradict (116) if C was large enough. 

We thus choose planes 7Tj(xj) £ Yl\{xi) so that (117) holds; from (95) we 
have (113) for all x« 6 Fj. We then extend 7Tj() to all of Si so that (113) holds 
for all Xi 6 S{. It is then an easy matter to check all the properties in the 
proposition. □ 




Figure 7. The set Eg[T p ], and its (grainy) slices at the planes z = zq, 

z = Z2, z = (zo + Z2)/2. The direction-separated assumption on 
T[T p ] means that the vectors (xo — X2,y — y 2 ) must essentially fill 

out a p x p square. 

11. Reduction to an additive setting 

The purpose of this section is to convert the conclusions of Proposition 
10.2 into pure additive combinatorics via some affine transformations and a 
discretization. This follows the philosophy of Bourgain [4], in which the prop- 
erties of the Besicovitch set were converted into statements about sums and 
differences of sets; our main innovation is that we shall also encode the plani- 
ness and graininess properties into this additive combinatorial setting. After 
some technical preparations, the desired contradiction will then follow from 
the lemmas in [4]. 

For reasons that shall become clear, we shall parametrize the integer lattice 
^ 4 by (x ,y ,X2,y 2 ). 
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Proposition 11.1. Suppose there exists a Besicovitch set of Minkowski 
dimension at most d+e for some 0<£<1, and let M S> 1 be a large number. 
Then there exists a subset SlcZ 4 with cardinality 

(118) #ft > M~ cV ~ e M 2 

such that 

• (Graininess) . If we define the projections Xq,Yq, X 2 ,Y 2 ^TL of Q by 



X = 


{xo 


(xo,y 0) x 2 ,y 2 ) G ft for some y ,x 2 ,y 2 }, 


Yq = 


{yo 


(xo,y 0) x 2 ,y 2 ) G ft for some x ,x 2 ,y 2 }, 


x 2 = 


{x 2 


(xo,y ,x 2 ,y 2 ) G ft for some x ,y ,y 2 }, 


Y 2 = 


{Y2 


(xo,y 0) x 2 ,y 2 ) G ft for some x ,y ,x 2 }, 



then we have 

(119) #X < M C ^M, 

(120) #y < M c ^M d - 2 , 

(121) #X 2 < M c ^M d - 2 , 

(122) #y 2 < M C ^M. 
Furthermore, we have 

(123) #{(x 2 + y ) + (x + y 2 ) : (x , y , x 2 , y 2 ) G ft} < M c ^M d ~ 2 . 

• {Planiness). There exist functions g : Xq x Yq — ► Y 2 and /i : X 2 x Y 2 — ► 

(124) y 2 = ff(x ,y ) 
and 

(125) x = /i(x 2 ,y 2 ) 
/or all (x ,y ,x 2 ,y 2 ) G ft. 

• {Direction- separated tubes). We have 

(126) #{(x , y , x 2 , y 2 ) G ft : (x 2 + y ) - (x + y 2 ) = w} < M C ^M 
for all weZ. 

One should view (126) and (123) as a statement about the difference set 
and sum set of X 2 + Yq and Xq + Y 2 . The conflict between these two estimates 
will eventually cause a contradiction if e is sufficiently small. 
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Proof. Fix M, and define 5 = M~ 2 , so that M = S' 1 / 2 . Note that it 
suffices to prove the above estimates with < replaced by 

For technical reasons we shall prove the proposition with the lattice Z 
replaced by 5Z, and Z 4 by ((5Z) 4 . Since the two lattices are isomorphic, this 
will not make any difference. 

By the results of the previous section, we can find T p , T(T p ), x®, 7if , Si, 
and 7Tj() which obey the conclusions of Proposition 10.2. 

We now apply some affine transformations to place these objects into a 
standardized form. Firstly we may apply an affine transformation (preserving 
the z coordinate) so that T p is the vertical tube 

T p = {(x,y,z) : |z| < 1, |x|, |y| < p}. 

By construction and (112), the planes 7r^ all contain the z-axis, and make 
an angle of gl b c ^~ e with respect to each other. 

By a rotation around the z-axis we may make 7r[j the xz-plane: 

(127) 7T° = {(x,y,z):y = 0}. 

By (112), we may apply a linear transformation (x, y, z) ^ (x + ky, y, z) for 
some \k\ < 5~ c ^ and make ir® the yz-plane without affecting 7r[j: 

(128) 7r 2 ° = {(x,y,z) :x = 0}. 

This transformation may distort the tube T p and the tubes T in T(T p ) slightly 
by a factor of 0(5~ Cv ^), but this will not significantly affect the argument 
which follows. 

Using (112) again, we see that by applying the linear transformation 
(x, y, z) i ^ (k'x,k"y,z) for some 5 C ^ < \k'\, \k"\ < 5~ c ^ we may make tt® 
equal to the x + y = plane without affecting 7Tq or ■ 

(129) 7r? = {(x,y,z) :x + y = 0}. 

Again this may distort the tubes T p and T(T p ) slightly, but this will not affect 
the argument. 

For any xo,X2 G M 3 let ¥(^0,^2) denote the set 

T(x , x 2 ) = {T G T[T P ] n T(x Q ) n T(x 2 ) : T C Ar(7r (x )) n N(tt 2 (x 2 ))}. 
From (114) we have 

dz dz 2 > 5 C ^5 A , 

where x« = (xj, y i , Zj) for i = 0, 2 and 

X(z ,z 2 ) = {(x ,y ,x 2 ,y 2 ) : ((x ,y ,z ), (x 2 ,y 2 ,z 2 )) G X}. 



II 



X(Z ,Z 2 ) 



#T(x , x 2 )dxody dx 2 dy 2 
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The quantities zq, z 2 both range inside intervals of length O(p). Thus one can 
find zo,Z2 £ M such that 

/ #T(zo, x 2 )dx dy dx 2 dy 2 > 5 C ^5 3 . 

Jx(z ,z 2 ) 

Fix zq, z 2 so that the above estimate holds. By factoring the (xo,y ) and 
(x2,y 2 ) planes into translates of the lattice (5Z,) 2 , we see that there must exist 
some translates Lq and L 2 of the lattice (5Z) 2 such that 

J2 #T(x ,x 2 )>5- c ^8-\ 
(x ,y ,X2,y 2 )GX(Zo,z 2 )n(LoxL2) 

By a very mild affine transformation we may make L$ = L 2 = (5'L) 2 . 

Define Q to be the set of all quadruplets (xQ,y Q ,x 2 ,y 2 ) which give a 
nonzero contribution to the above sum. From the direction-separated nature 
of T[T p ], each such quadruplet contributes at most < 8~ c ^ to the sum, and 
so we see that Q satisfies (118). 

Since (xo,y ,zo) lies in So, and xo, y live in <5Z, the bounds (119) and 
(120) follow from the graininess of So and (127). Similarly the bounds (121) 
and (122) follow from the graininess of S 2 and (128). 

Also, since (^+2k , lies in S , the bound (123) follows from 

(129) and the graininess of S±. 




Jt (x„,y ,z ) 




\<-*o-y - z o> 



(x ,y ) 



Figure 8. Since the plane 7To(xo, y , zo) must be close to ttq, its intersection 
with {z = z 2 } is essentially a horizontal line, which explains (124). 
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From the direction-separated nature of the tubes in T[T p ] we see that 
#{(x ,y ,x 2 ,y 2 ) G SI : (x 2 - x , y 2 - y ) = (x,y)} < M c ^ 

for all x, y. The estimate (126) then follows from this and the fact that x 2 — xo 
ranges over a 5-separated subset of a 0((5 _c,v/ ^5 1//2 )-interval, which therefore 
has cardinality 0(M c ^M). 

Finally, we turn to (125) and (124). Fix (x , y , x 2 , y 2 ) G SI, and let T be 
a tube in T(xo,x 2 ), where Xi = (xj,y i; Zj). Then by definition of T(xo,x 2 ) we 
see that T is contained in a 5' -neighbourhood of the plane 7To(xo, y , zo). 
From (113), (127), and elementary geometry we therefore see that y 2 is con- 
tained in a set of cardinality 0(M C ^) which is determined only by 7To(xo, y , zo) 
(so it is independent of x 2 or y 2 ). By the pigeonhole principle we may therefore 
refine SI so that (124) holds for some function g, without affecting (118) (other 
than a worsening of constants). Similarly we can refine SI further to obtain 
(125) for some function h. □ 



12. Sums, differences, and functional relationships 

In the previous arguments, the parameter d was not fixed to have any 
particular value. However, the arguments we shall now use work for d = 5/2 
only, and we shall fix this value accordingly. Of course, this value of d is 
admissible by Theorem 2.3. 




Figure 9. The shape of (130) which is dictated by the all the properties 
of Proposition 11.1, except for (126), with which it is incompatible. 

Having obtained Proposition 11.1, we shall now use pure combinatorics 
and the lemmas in [4] to show that the conclusions of this proposition lead to 
a contradiction if e is sufficiently small. 

The idea is to show that (126) is incompatible with the rest of the prop- 
erties in Proposition 11.1. In fact, those properties have a strong tendency to 
make the set 



(130) 



{(x 2 -x ,y 2 -y ) : (x , y , x 2 , y 2 ) G SI} 
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resemble the set depicted in Figure 9; this is due to the heuristic implication 
of (5) from (4), combined with the functional relationships (125) and (124). 
Of course, such a set is highly incompatible with (126), and so we can attain 
the desired contradiction. 

The property of planiness has evolved into the functional relationships 
(125) and (124), which state that xo is independent of y and y 2 is independent 
of X2 . It turns out that we can strengthen these relationships further in the case 
d = 5/2, and conclude that y 2 is also independent of y , and xo is independent 
of X2. In other words, there is a direct functional relationship between y 2 and 

Xo. 

More precisely: 

Proposition 12.1. Let the notation be as in Proposition 11.1, and sup- 
pose that d = 5/2. Then there exists a subset Q.\ of Q with cardinality 

(131) #«i > M~ C ^M 2 

and functions g : Xq — > Y 2 and h : Y 2 — > Xo such that 

(132) y 2 = s(x ) 
and 

(133) x = h(y 2 ) 

for all (xo, y , x 2 , y 2 ) G Q\. In particular, we have 

(134) #{(x 2 + y ) + /(y 2 ) : (x ,y ,x 2 ,y 2 ) G ^} < M^M 1 ' 2 
and 

(135) #{(x ,y ,x 2 ,y 2 ) G Q : (x 2 + y ) - /(y 2 ) = w} < M C ^M 
for all w G Z, where f : F 2 Z is the function /(y 2 ) = y 2 + h(y 2 )- 

Proof. The argument will be a heavily disguised variant of the "hairbrush" 
constructions in [17]. 

For each (xo,y ) G Xo x Yq, define the multiplicity 

W)(x ,y ) = #{( x 2>y2) : ( x o,y > x 2,y2) e 

From (121) and (124) we have 

Moo <M C ^M 1 ' 2 ; 

from (118) we have 

Mi >M~ C ^M 2 ; 
and finally, from (119) and (120) we have 

#supp(/i ) < M C ^M 3 / 2 . 
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Combining these facts by Lemma 1.2, we may therefore find a subset S C 
Xq x Yq with cardinality 

(136) #S > M~ C ^M 3 / 2 
such that 

(137) Mo(x ,y ) > M~ C ^M 1 / 2 for all (x ,y ) G 5. 
In particular, if we define C to be the set 

{(xo,y > x 2,y2) G n '■ ( x oiyo) e S}, 

then we have 

(138) #ST > M~ C ^M 2 . 

Now for each (x2,y 2 ) G X 2 x F 2 , define the multiplicity 
^2(x 2 ,y 2 ) = #{(xo,y ) : (xo,y ,x 2 ,y 2 ) e $7'}. 
From (138) we have 

^^/x 2 (x 2 ,y 2 )>M-^M 2 . 
x 2 y 2 

From (121), we can therefore find a number x 2 G X 2 such that 

E^( x 2,y 2 )>M^ M 3/ 2 . 

y 2 

Fix this x£. From (120) and (125) we have 

sup/z 2 (x§,y 2 ) < M^M 1 / 2 , 

y 2 

finally, from (122) we have 

#supp(/i 2 (x0,-)) <M C ^M. 

Combining these facts by Lemma 1.2, we may therefore find a set Y" 2 C Y" 2 of 
cardinality 

(139) #F 2 > M~ C ^M 
such that 

(140) fi 2 (4, y 2 ) > M'^M 1 / 2 for all y 2 G F 2 . 
For each y 2 in Y^', define the set 

^4(y 2 ) = {(xo,y ) e Y"o : (x ,y ,x^,y 2 ) G 0'}. 
By (124) these sets are disjoint as y 2 varies. By (140) these sets have cardinality 

(141) #A(y 2 ) > M-^m 1 / 2 . 
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By (125) these sets are contained in the row {h(x 2 , 2/2)} x ^o- Combining these 
three facts with (120) we thus see that 

#{y 2 G Y 2 ' : /i(x§,y 2 ) = x } < M c ^ for all x . 

We may therefore find a subset Y 2 °f ^2 with cardinality 

(142) #Y 2 " > M~ cv ~ e M 

such that the map h(x 2 , •) is injective on Y 2 . 
We now define the set ^1 by 

^1 = < (x ,y ,x 2 ,y 2 ) : (x ,y ) G (J A(y 2) 

By construction we have A(y 2 ) C S for all y 2 G F 2 . Thus from (137), (141), 

(142) and the disjointness of the A(y 2 ) we have 

> M- C ^MV 2 # [J A(y 2 ) > M-C^M^M-C^MM-C^M 1 ' 2 
which is (131). 

We now show (132). Let (xo, y , x 2 , y 2 ) be an element of £7i . Then by 
construction there exists a y' 2 such that (xo, y , x 2 , y' 2 ) G 0'. By (125) we have 

x = /i(x 2 ,y 2 ) = /i(x^,y' 2 ), 

but by (124) we have y 2 = y' 2 . Thus (133) follows by setting h(y 2 ) = /i(x 2 ,y 2 ). 
The claim (132) then follows since h is injective by construction. Finally, the 
claims (134) and (135) follow from (123) and (126) respectively after applying 
(133). □ 

Having strengthened the functional relationship, the next step is to reduce 
the number of values that x 2 + y and /(y 2 ) can take. More precisely: 

Proposition 12.2. Let the notation be as in the previous propositions. 
Then one can find a subset S7 2 of Qi with cardinality 

(143) #n 2 > M~ C ^M 2 
such that, if A and B denote the sets 

A = {x 2 +y : (x ,y ,x 2 ,y 2 ) G £1 2 } 

and 

B = if (72) '■ ( x o,y > x 2,y 2 ) e ^2}, 

then 

(144) #A, #B < M c ^~ e M 1/2 . 
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Proof. Define the multiplicity function fii(a) on Z by 

Mi («) = {( x 2,Yo) e X 2 x y : x 2 + y = a}- 
We consider the problem of estimating the cardinality of 

(145) {(x ,y ,x 2 ,y 2 ) G Ox : Mi(x 2 + y ) < M-^M 1 ! 2 ) 

where Cj is a constant to be chosen later. 

First consider the contribution of a single y 2 G I2 to (145). The quantity 
x is fixed by (133). By (134) there are < M c ^ l M 1 l 2 possible values of x 2 + y 
which contribute. Thus the total contribution of y 2 to the cardinality of (145) 
is at most 

(146) M c ^ e M 1 / 2 M- C7 ^ e M 1 / 2 . 
By (122) we therefore have 

#(145) < M cv ~ e M~ Cr ^M 2 . 
Similarly, define the multiplicity function [12(b) on Z by 
A*2(&) = {V2 G Y 2 : f(y 2 ) = b}, 
and consider the cardinality of 

(147) {(x ,y ,x 2 ,y 2 ) G ^ : fi 2 (f(y 2 )) < M~ C ^M 1 ' 2 ) . 

We consider the contribution of a single (x 2 ,y ) 6 X 2 x lo to (147). By (134) 
there are < M C ^M 1 / 2 possible values of /(y 2 ) which contribute. Since xo 
is determined by (133), we see that the contribution of each (x 2 ,y ) to the 
cardinality of (147) is at most (146) as before. From (121) and (120) we thus 
have 

#(147) < M C ^M~ C7 ^M 2 . 

Define £l 2 to be the set £li with (145) and (147) removed. By (131) we 
see that (143) holds, if we choose C7 large enough. By construction we have 

/ii (a) > M~ C ^M 1 ' 2 for all a G A 

and 

M2(&) > M- c ^ e M 1 / 2 for all be B. 

On the other hand, from (121), (120), and (122) we have 

IMi = #X 2 #Yo < M C ^M, Win || ! = #y 2 < M C ^ M , 

and (144) follows from Chebyshev's inequality. □ 

Now we can finally apply Bourgain's lemma. Let G denote the subset of 
Ax B given by 

G = {(x 2 + y ,/(y 2 )) : (x ,y ,x 2 ,y 2 ) G tt 2 }. 
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From (143) and (135) we have 

#{a - b : (a, b) e G} > M- C ^M. 
On the other hand, from (134) we have 

#{a + b : (a, 6) 6 G) < M^M 1 / 2 . 

Finally, the cardinality of A and B is controlled by (144). If we now ap- 
ply Lemma 2.54 of [4] (or Lemma 14.1 in the Appendix of this paper) with 
we obtain a contradiction, if e is sufficiently small. Thus there 
are no Besicovitch sets of Minkowski dimension < 5/2 + e, if e is a sufficiently 
small absolute constant. This (finally!) completes the proof of Theorem 0.4. 

□ 

13. Remarks 

• A large part of the above argument works for any dimension d, providing 
that one has an X-ray estimate at dimension d to begin with. This raises the 
possibility that one can obtain a much better result than Theorem 0.4 by 
a bootstrap argument. At present, however, there are two obstructions to 
this idea. Firstly, the conclusion of Theorem 0.4 is a Minkowski dimension 
statement, which is considerably weaker than the X-ray estimate hypothesis. 
Secondly, the additive combinatorial portion of the argument is only effective 
at dimension close to 5/2. 

• We now consider the question of what happens when we relax the 
direction-separated assumption on the collection of tubes T. Our discussion 
shall be fairly informal. 

Definition 13.1. A family of <5-tubes T is said to obey the Wolff axioms 

if: 

• For any 5 < a < 1 and any cr-tube T a , there are at most iS a 2 5~ 2 tubes 
in T which are contained in T a . 

• For any S < a < 1, any cr-tube T a , and any plane ir there are at most 
£ a5 _1 tubes in T which are contained in T a n Ncs(^)- 

Clearly, any direction-separated collection of tubes obeys the Wolff ax- 
ioms. The Kakeya results in [17] extend easily to these types of collections; for 
instance, we have 

(148) | |J T| > 5 1/2 

TGT 
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whenever T is a collection of tubes with cardinality « 5~ 2 that satisfies the 
Wolff axioms. In contrast, the arguments in Bourgain [4] do not say anything 
concerning this generalized setting in any dimension, because these arguments 
crucially rely on the direction-separated nature of T. For similar reasons the ar- 
guments in our paper also do not adapt to this setting even if one can somehow 
attain the analogue of stickiness. The estimate (126) in particular is critically 
dependent on the direction-separation of T. 

One may then ask if there is any improvement to (148) if one only assumes 
that T satisfies the Wolff axioms, or a similar condition which is weaker than 
direction-separation. It turns out in fact that there is no such improvement in 
three dimensions if one replaces the underlying field M by C, and multiplies all 
relevant exponents by a factor of 2. Indeed, if we take E to be the unit ball of 
the Heisenberg group: 

{{zi,z 2 ,z 3 ) G C 3 : lm(z 3 ) = Im(zi^), \z x \, \z 2 \, \z 3 \ < lj , 

then we see that E has Minkowski dimension 5, and for any a, b G M. and ioGC 
with |a|,|fr|,|u>|<l the (complex) line segment 

{(z, w + az, zw + b) : z G C, \z\ < 1} 

is contained in E. We can then find a collection T of complex 5-tubes in 
Ncs{E) with cardinality ~ <5~ 4 which satisfy the complex version of the Wolff 
axioms, and such that 

| (J r| ~ s. 

TeT 

Informally, the collection T is also sticky, plany, and grainy in the spirit of 
the above argument. By "stickiness" we mean that the tubes T can be covered 
by a collection T p of p-tubes which are not direction-separated, but satisfy 
the Wolff axioms and have cardinality ~ p~ 2 . The planiness and graininess 
properties are best seen by first verifying them at the origin (0, 0, 0) (in which 
case the relevant plane is the z 3 = plane) , and then using the transitive action 
of the Heisenberg group. One can find a set which obeys all the properties of 
Proposition 11.1 (with the trivial distinction that the lattice Z is replaced by 
the Gaussian integer lattice Z[i]) except for the crucial (126), which is lacking 
because T is not direction-separated. Namely, we may take 

X = Y 2 = {a + bi G Z[i] : \a + bi\ < M 1 / 2 }, 
X 2 = Y = {a G Z : \a\ < M 1 / 2 }, 

n = {(x ,y ,x 2 ,y 2 ) G X x Y x X 2 x Y 2 : x = y^}, 

s(x ,y ) = 

fr(x2,y 2 ) = Y2- 

We remark that the set (130) in this example resembles Figure 9 rather than 
the set in Figure 7. In particular, (126) fails dramatically. 
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The counterexamples over C are based on the fact that C contains a 
subfield of half the dimension. It seems natural to ask whether similar objects 
exist in M; for instance, Erdos [5] has posed the question of whether a Borel 
subring of R exists with Hausdorff dimension 1/2. These types of questions are 
related to the distance set conjecture of Falconer [6] and are also connected 
to the Furstenburg problem mentioned in [19]. Two of us will pursue this 
connection in further detail in [9]. 

• Our main argument relied on only three slices of the Besicovitch set, and 
discovered that sets such as X2 and Yq had very good additive and subtractive 
properties. One may naturally ask whether one could gain more information 
by looking at more slices. For instance, one can try to utilize planiness at the 
slice z = (zo + Z2)/2; this property was not exploited in the arguments above. 
This property in fact leads to an interesting observation, that a large portion of 
the set -E[Tp] is approximately invariant under a (5-separated set of translations 
with cardinality p" 1 / 2 in the (1, 1,0) direction, and can be used to provide an 
alternate proof of the contradiction obtained in the arguments above. 

If one exploits planiness and graininess on other horizontal slices, one can 
also discover multiplicative properties on X2 and Yq. Very roughly speaking, 
the sets X2 and Yq are affinely equivalent to a p-separated set of diameter 
0(1) which is an approximate ring, in the sense that it is often closed under 
addition and multiplication (up to an uncertainty of p) . Thus there is (a very 
weak and heuristic) converse to the connections between 1/2-dimensional fields 
and 5/2-dimensional Kakeya sets mentioned earlier. 

We hope to detail these results in a future paper. 

• Define a Nikodym set is a subset E of M 3 such that for every x 6 M 3 
there exists a line I containing x such that I n E contains a unit line segment. 
In analogy with the Kakeya conjecture, it is conjectured that all Nikodym 
sets have full dimension. At first glance, the above results do not appear to 
have any bearing on Nikodym sets as the associated collection of <5-tubes is 
not direction-separated. However this is easily resolved, as one can apply a 
projective transformation to turn a Nikodym set into a Kakeya set of the same 
dimension, and so Theorem 0.4 is also valid for Nikodym sets; see [13]. 

• It seems likely that these techniques could be extended to higher dimen- 
sions and give an improvement on (1) for all n > 3. Indeed, the fact that the 
codimension of a set with dimension (n + l)/2 increases rapidly with n is a 
favourable sign, especially when compared with the analogue of the graininess 
property in higher dimensions. Of course, for n sufficiently large (n > 8) the 
estimate (2) is superior to (1) anyway. Two of us have completed the first step 
of this program in [10], in which the results in [18] are generalized to higher 
dimensions. 
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14. Appendix: A lemma of Bourgain 

In this section we prove a weakened version of Lemma 2.54 of [4], namely: 

Lemma 14.1 ([4]). Let N » I, < e < 1 and let A, B be subsets of Z 
such that 

(149) #A, #B < N. 
Then, if G is a subset of A x B such that 

(150) #{a + b : (a,b) G G} < N, 
then we have 

(151) #{a-t:(a,6)eG}<JV 2 - £ 

for some sufficiently small absolute constant e > 0. 

In [4] this lemma was proven with the explicit constant e = 1/13. Our 
argument is copied from [4] but is slightly simplified as we are not trying to 
optimize the constants. The argument in [4] is based in turn from the work 
of Gowers [7] . Recently, two of the authors have managed to improve this to 
e = 1/6, see [8]. It is not known what the best value of e is, but one must have 



log(6) 

e < 2 - — = 0.36907. .. ; 
log 3 



see [8], [12]. 



Proof. Let e be a small constant to be chosen later. Suppose for con- 
tradiction that we had sets A, B, G for which (151) failed. Then we clearly 
have 

(152) #G«iV 2 ; 

in this section the £ notation will be used with 5 replaced by iV -1 . 

Proposition 14.2. Let A, B, G satisfy (152) and the hypotheses of 
Lemma 14.1. Then there exist ps N numbers d such that 

(153) #{{a,b)eAxB:a-b = d}^N. 



A number d which satisfies (153) shall be called an (^4, B)-popular differ- 
ence. 

Proof. The upper bound in (153) is clear from (149), as is the upper bound 
on the number of popular differences. Thus it suffices to show the lower bound 
in both cases. 
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Let X denote the set in (150). From (152) we have 
\\xa*Xb\\lhx) > #G>N 2 . 
From (150) and (9) with d = 2 we thus have 

IIxa*xbI|2 £iv 3/2 . 

From the identity 

Wxa * xbIU = Wxa * x-Bh; 

we thus obtain 

J( X a*X-b) 2 ZN 3 . 
On the other hand, from (149) we have 

/ XA*X-B £N 2 ; 



thus 

/ iXA * X-B? % N 3 

for appropriate choices of constants in the j§ symbols. On the other hand, from 
(149) again we have 

Wxa * X-b||oo ^ ^V- 
Combining the two estimates we obtain 

#{XA*X-B^N}N 2 >N 3 , 
and the claim follows. □ 
Definition 14.3. If a, a' £ A, we say that a and a' (A, B)- communicate if 
#{6 e B : a - b, a' - b are both (A, ^-popular differences} > N~^N. 

It is quite plausible that under the conclusions of Proposition 14.2, that 
many pairs in A communicate. We shall go even further, and show that we 
can find a refinement of A for which almost all pairs communicate. 

Definition 14.4. A refinement of A is any subset A' of A such that 
#A> « #A 

Proposition 14.5. Let A, B be sets satisfying (149) and the conclusion 
of Proposition 14.2. Then there exists a refinement A' of A such that 

Va' £ A' : [\/a £ A' : a and a' (A, B)- communicate] . 
a' a 

Here the notation V is as before, but with 5 replaced by N^ 1 . 
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Proof. Let 7 C A x B be the incidence matrix 

I = {(a,b) G A x B : a — b is a (A, 7?)-popular difference}; 
by hypothesis we have 

(154) #I^N 2 . 
We define the rows I a and columns I b of 7 by 

I a = {beB: (a,b) £7} 

and 

/ = {o £ i : (a, 6) £/}. 

From (154) we have 

beB 

By (149) and Lemma 1.2 one can then find a refinement 7?' of 7? such that 

(155) #I b « JV for all 6 G 

Fix 73', and let Y" C i x A be the set of noncommunicative pairs; i.e. 

Y = {(a, a')eAxA: #(7 a n I a >) < N'^N}. 

Unfortunately, Y may be quite large. However, for many beB' the set 
(I b x I b ) n Y is quite small. Indeed, we have 

E#(U 6 x7 6 )ny)= E #(i a ni a ,)<N-^N(#A) 2 <n~^n 3 

beB' (a,o')ey 
by (149). On the other hand, from (155) we see that 

(156) ]T #(7 6 x 7 6 ) « AT 2 #£' « iV 3 . 

66 B' 

Thus we have 

V 6 G 7?', a G 7 fc , a' & I b : (a, a') Y. 
6, a, a' 

By Corollary 5.9 it follows that 



yb£B':\ V a€ I b ,a' G 7 6 : (a, a') Y 



6 



a, a 



so that we may find a b € B' such that 



Vaef.a'ef : (a, a') Y. 



a, a' 



Fix this b. By Corollary 5.9 again we have 

Va' € 7 6 : [ Va G 7 6 : (a, a') Y 
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The claim then follows by setting A' to be those elements of I b for which 

Va e I b : (a, a') g Y 
a 

holds. □ 
A variant of the above proposition is 

Proposition 14.6. Let A, B be sets satisfying (149) and the conclusion 
of Proposition 14.2. Then there exists a refinement B' of B such that for all 
b' £ B' there are f» N numbers a £ A for which a — b' is a (A, B)-popular 
difference. 

Proof. As in Proposition 14.2, the upper bound on the number of a is 
trivial from (149). 

For each b' 6 B let n(b') denote the quantity 

fi(b') = #{a G A : a — b' is a (A, -B)-popular difference}. 

Then by hypothesis 

On the other hand, from (149) we clearly have 

SUpp(//), ||/i||oc ^ N. 

The claim then follows from Lemma 1.2. □ 
We now are ready to give the key proposition. 

Proposition 14.7. Let A, B, G satisfy (152) and the hypotheses of 
Lemma 14.1. T/ien if e is sufficiently small, there exists refinements A' and B' 
of A and B respectively such that 

(157) \G D (A' x B')\ > N 2 
and 

(158) \{a! - b' : (a', b') Gi'x B'}\ < N c ^~ e N. 

Proof. Define the multiplicity ^(a) on yl by 

flG (a)=#{kB:(a,6)eG}. 
Then from (152) we have 

IImgIIi ~ N 2 

Thus by (149) and Lemma 1.2 we can find a refinement A\ of A such that 

(159) n G (a) w A" for all a G A x . 
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Fix this A 1 . Note that #(G n (A 1 x B)) > N 2 . By Proposition 14.2 there 
are §; N (Ai, B)-popular differences, so that by Proposition 14.5 there exists 
a refinement A' of A\ such that 

(160) Va G A' : a', a (Al, B)-communicate 

for all a! G A'. 

Fix A'. From (159) we have 

#(G n (A' x B)) « N#A' « iV 2 . 

Thus if we define the multiplicity ,u G (6) on B by 

^)=#{ fl ei':(a,6)eG}, 

then 

ll/« G ||i ^^ 2 - 

By another application of (149) and Lemma 1.2 we can find a refinement B\ 
of B such that 

(161) fi G (b) « iV for all 6 G Bi. 

Fix this Bi. By Proposition 14.6 there exists a refinement B' of Bi such 
that for every b' G B' there are « iV a G A' such that a — b' is a (A', Bi)-popular 
difference. 

Fix B'. Combining this with (160) we conclude (if e is sufficiently small) 

Fact 14.8. For every a' G A' and 5' G B', there are ~ elements a £ A' 
such that a', a (A, B)-communicate, and a — b' is a (A, B)-popular difference. 

We can now finish the proof of Proposition 14.7. Since (157) follows from 
(161), it suffices to show (158). 

Fix a' , b' . From the above fact and the definition of communicativity, one 
can find N~ C ^N 2 pairs (a, b) G Ax B such that a' — b, a — b, and a — b' 
are (A, B)-popular differences. From the identity 

a' -b' = {a' -b)-(a-b) + (a- b') 
and the definition of a popular difference, we thus obtain 

#{(01,02,03,61,62,63) G A 3 xB 3 :a'-b' = (oi - 61) - (a 2 - 6 2 ) + (o 3 - 63)} 

> N~ C ^N 5 . 

Since there are only « ./V 6 sextuplets (01,02,03,61,62,^3)) (158) follows. □ 

We can now prove Lemma 14.1. We may refine G so that the map (a, 6) 
a — 6 is injective on G without affecting the hypotheses of the lemma. But this 
injectivity is clearly incompatible with the conclusions of Proposition 14.7 if e 
is sufficiently small. This is our desired contradiction. □ 
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